Dataset statistics
| Number of variables | 51 |
|---|---|
| Number of observations | 39717 |
| Missing cells | 118497 |
| Missing cells (%) | 5.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 15.5 MiB |
| Average record size in memory | 408.0 B |
Variable types
| Numeric | 25 |
|---|---|
| Categorical | 12 |
| Text | 7 |
| DateTime | 5 |
| Boolean | 2 |
pymnt_plan has constant value "" | Constant |
initial_list_status has constant value "" | Constant |
application_type has constant value "" | Constant |
loan_status is highly imbalanced (51.4%) | Imbalance |
pub_rec is highly imbalanced (86.6%) | Imbalance |
pub_rec_bankruptcies is highly imbalanced (83.7%) | Imbalance |
emp_title has 2459 (6.2%) missing values | Missing |
emp_length has 1075 (2.7%) missing values | Missing |
desc has 12942 (32.6%) missing values | Missing |
mths_since_last_delinq has 25682 (64.7%) missing values | Missing |
mths_since_last_record has 36931 (93.0%) missing values | Missing |
next_pymnt_d has 38577 (97.1%) missing values | Missing |
pub_rec_bankruptcies has 697 (1.8%) missing values | Missing |
annual_inc is highly skewed (γ1 = 30.9491846) | Skewed |
collection_recovery_fee is highly skewed (γ1 = 25.02941576) | Skewed |
id has unique values | Unique |
member_id has unique values | Unique |
url has unique values | Unique |
delinq_2yrs has 35405 (89.1%) zeros | Zeros |
inq_last_6mths has 19300 (48.6%) zeros | Zeros |
mths_since_last_delinq has 443 (1.1%) zeros | Zeros |
mths_since_last_record has 670 (1.7%) zeros | Zeros |
revol_bal has 994 (2.5%) zeros | Zeros |
out_prncp has 38577 (97.1%) zeros | Zeros |
out_prncp_inv has 38577 (97.1%) zeros | Zeros |
total_rec_late_fee has 37671 (94.8%) zeros | Zeros |
recoveries has 35499 (89.4%) zeros | Zeros |
collection_recovery_fee has 35935 (90.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-10 11:29:49.607329 |
|---|---|
| Analysis finished | 2024-04-10 11:31:33.009795 |
| Duration | 1 minute and 43.4 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIQUE 
| Distinct | 39717 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 683131.91 |
| Minimum | 54734 |
|---|---|
| Maximum | 1077501 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 54734 |
|---|---|
| 5-th percentile | 372418.4 |
| Q1 | 516221 |
| median | 665665 |
| Q3 | 837755 |
| 95-th percentile | 1039966.2 |
| Maximum | 1077501 |
| Range | 1022767 |
| Interquartile range (IQR) | 321534 |
Descriptive statistics
| Standard deviation | 210694.13 |
|---|---|
| Coefficient of variation (CV) | 0.30842379 |
| Kurtosis | -0.7298894 |
| Mean | 683131.91 |
| Median Absolute Deviation (MAD) | 160026 |
| Skewness | 0.078807632 |
| Sum | 2.713195 × 1010 |
| Variance | 4.4392018 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 69001 | 1 | < 0.1% |
| 588642 | 1 | < 0.1% |
| 583126 | 1 | < 0.1% |
| 587559 | 1 | < 0.1% |
| 588668 | 1 | < 0.1% |
| 588657 | 1 | < 0.1% |
| 588664 | 1 | < 0.1% |
| 588649 | 1 | < 0.1% |
| 588646 | 1 | < 0.1% |
| 588608 | 1 | < 0.1% |
| Other values (39707) | 39707 |
| Value | Count | Frequency (%) |
| 54734 | 1 | |
| 55742 | 1 | |
| 57245 | 1 | |
| 57416 | 1 | |
| 58915 | 1 | |
| 59006 | 1 | |
| 61390 | 1 | |
| 61419 | 1 | |
| 62102 | 1 | |
| 65426 | 1 |
| Value | Count | Frequency (%) |
| 1077501 | 1 | |
| 1077430 | 1 | |
| 1077175 | 1 | |
| 1076863 | 1 | |
| 1075358 | 1 | |
| 1075269 | 1 | |
| 1072053 | 1 | |
| 1071795 | 1 | |
| 1071570 | 1 | |
| 1070078 | 1 |
member_id
Real number (ℝ)
UNIQUE 
| Distinct | 39717 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 850463.56 |
| Minimum | 70699 |
|---|---|
| Maximum | 1314167 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 70699 |
|---|---|
| 5-th percentile | 388192.4 |
| Q1 | 666780 |
| median | 850812 |
| Q3 | 1047339 |
| 95-th percentile | 1269461.8 |
| Maximum | 1314167 |
| Range | 1243468 |
| Interquartile range (IQR) | 380559 |
Descriptive statistics
| Standard deviation | 265678.31 |
|---|---|
| Coefficient of variation (CV) | 0.31239235 |
| Kurtosis | -0.56296801 |
| Mean | 850463.56 |
| Median Absolute Deviation (MAD) | 190427 |
| Skewness | -0.21241637 |
| Sum | 3.3777861 × 1010 |
| Variance | 7.0584963 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 265533 | 1 | < 0.1% |
| 756244 | 1 | < 0.1% |
| 749343 | 1 | < 0.1% |
| 754885 | 1 | < 0.1% |
| 756270 | 1 | < 0.1% |
| 756259 | 1 | < 0.1% |
| 756267 | 1 | < 0.1% |
| 756242 | 1 | < 0.1% |
| 756248 | 1 | < 0.1% |
| 756206 | 1 | < 0.1% |
| Other values (39707) | 39707 |
| Value | Count | Frequency (%) |
| 70699 | 1 | |
| 73673 | 1 | |
| 74724 | 1 | |
| 76583 | 1 | |
| 80353 | 1 | |
| 80364 | 1 | |
| 84914 | 1 | |
| 85483 | 1 | |
| 86999 | 1 | |
| 89243 | 1 |
| Value | Count | Frequency (%) |
| 1314167 | 1 | |
| 1313524 | 1 | |
| 1311748 | 1 | |
| 1311441 | 1 | |
| 1306957 | 1 | |
| 1306721 | 1 | |
| 1305201 | 1 | |
| 1305008 | 1 | |
| 1304956 | 1 | |
| 1304884 | 1 |
loan_amnt
Real number (ℝ)
| Distinct | 885 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11219.444 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5500 |
| median | 10000 |
| Q3 | 15000 |
| 95-th percentile | 25000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 9500 |
Descriptive statistics
| Standard deviation | 7456.6707 |
|---|---|
| Coefficient of variation (CV) | 0.66462035 |
| Kurtosis | 0.76866855 |
| Mean | 11219.444 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 1.0593173 |
| Sum | 4.4560265 × 108 |
| Variance | 55601938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2833 | 7.1% |
| 12000 | 2334 | 5.9% |
| 5000 | 2051 | 5.2% |
| 6000 | 1908 | 4.8% |
| 15000 | 1895 | 4.8% |
| 20000 | 1626 | 4.1% |
| 8000 | 1586 | 4.0% |
| 25000 | 1390 | 3.5% |
| 4000 | 1130 | 2.8% |
| 3000 | 1030 | 2.6% |
| Other values (875) | 21934 |
| Value | Count | Frequency (%) |
| 500 | 5 | < 0.1% |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 900 | 2 | < 0.1% |
| 950 | 1 | < 0.1% |
| 1000 | 301 | |
| 1050 | 4 | < 0.1% |
| 1075 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 679 | |
| 34800 | 2 | < 0.1% |
| 34675 | 1 | < 0.1% |
| 34525 | 1 | < 0.1% |
| 34475 | 5 | < 0.1% |
| 34200 | 1 | < 0.1% |
| 34000 | 15 | < 0.1% |
| 33950 | 9 | < 0.1% |
| 33600 | 6 | < 0.1% |
| 33500 | 2 | < 0.1% |
funded_amnt
Real number (ℝ)
| Distinct | 1041 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10947.713 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5400 |
| median | 9600 |
| Q3 | 15000 |
| 95-th percentile | 25000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 9600 |
Descriptive statistics
| Standard deviation | 7187.2387 |
|---|---|
| Coefficient of variation (CV) | 0.65650593 |
| Kurtosis | 0.93755199 |
| Mean | 10947.713 |
| Median Absolute Deviation (MAD) | 4600 |
| Skewness | 1.0817102 |
| Sum | 4.3481032 × 108 |
| Variance | 51656400 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2741 | 6.9% |
| 12000 | 2244 | 5.6% |
| 5000 | 2040 | 5.1% |
| 6000 | 1898 | 4.8% |
| 15000 | 1784 | 4.5% |
| 8000 | 1573 | 4.0% |
| 20000 | 1456 | 3.7% |
| 25000 | 1133 | 2.9% |
| 4000 | 1127 | 2.8% |
| 3000 | 1022 | 2.6% |
| Other values (1031) | 22699 |
| Value | Count | Frequency (%) |
| 500 | 5 | < 0.1% |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 900 | 2 | < 0.1% |
| 950 | 1 | < 0.1% |
| 1000 | 302 | |
| 1050 | 5 | < 0.1% |
| 1075 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 554 | |
| 34800 | 1 | < 0.1% |
| 34675 | 2 | < 0.1% |
| 34525 | 1 | < 0.1% |
| 34475 | 4 | < 0.1% |
| 34250 | 1 | < 0.1% |
| 34000 | 14 | < 0.1% |
| 33950 | 6 | < 0.1% |
| 33600 | 6 | < 0.1% |
| 33500 | 1 | < 0.1% |
funded_amnt_inv
Real number (ℝ)
| Distinct | 7940 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10397.449 |
| Minimum | 0 |
|---|---|
| Maximum | 35000 |
| Zeros | 142 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1873.658 |
| Q1 | 5000 |
| median | 8975 |
| Q3 | 14400 |
| 95-th percentile | 24736.572 |
| Maximum | 35000 |
| Range | 35000 |
| Interquartile range (IQR) | 9400 |
Descriptive statistics
| Standard deviation | 7128.4504 |
|---|---|
| Coefficient of variation (CV) | 0.6855961 |
| Kurtosis | 1.0625444 |
| Mean | 10397.449 |
| Median Absolute Deviation (MAD) | 4200 |
| Skewness | 1.1062129 |
| Sum | 4.1295548 × 108 |
| Variance | 50814806 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 1309 | 3.3% |
| 10000 | 1275 | 3.2% |
| 6000 | 1200 | 3.0% |
| 12000 | 1069 | 2.7% |
| 8000 | 900 | 2.3% |
| 4000 | 813 | 2.0% |
| 3000 | 804 | 2.0% |
| 15000 | 659 | 1.7% |
| 7000 | 600 | 1.5% |
| 2000 | 453 | 1.1% |
| Other values (7930) | 30635 |
| Value | Count | Frequency (%) |
| 0 | 142 | |
| 0.01 | 7 | < 0.1% |
| 0.48 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 18.04 | 1 | < 0.1% |
| 23.99 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 32.33 | 1 | < 0.1% |
| 42.81 | 1 | < 0.1% |
| 50.34 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 135 | |
| 34997.35 | 1 | < 0.1% |
| 34993.66 | 1 | < 0.1% |
| 34993.33 | 1 | < 0.1% |
| 34993.26 | 1 | < 0.1% |
| 34993.2 | 1 | < 0.1% |
| 34990.43 | 1 | < 0.1% |
| 34987.98 | 1 | < 0.1% |
| 34987.27 | 1 | < 0.1% |
| 34977.35 | 1 | < 0.1% |
term
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 397170 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 months |
|---|---|
| 2nd row | 36 months |
| 3rd row | 36 months |
| 4th row | 36 months |
| 5th row | 36 months |
Common Values
| Value | Count | Frequency (%) |
| 36 months | 29096 | |
| 60 months | 10621 | 26.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| months | 39717 | |
| 36 | 29096 | |
| 60 | 10621 | 13.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 | |
| 3 | 29096 | 7.3% |
| 0 | 10621 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 238302 | |
| Space Separator | 79434 | 20.0% |
| Decimal Number | 79434 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 39717 | |
| 3 | 29096 | |
| 0 | 10621 | 13.4% |
Space Separator
| Value | Count | Frequency (%) |
| 79434 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 238302 | |
| Common | 158868 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 |
Common
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| 3 | 29096 | 18.3% |
| 0 | 10621 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 397170 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 | |
| 3 | 29096 | 7.3% |
| 0 | 10621 | 2.7% |
int_rate
Text
| Distinct | 371 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.6942871 |
| Min length | 5 |
Characters and Unicode
| Total characters | 226160 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 8.94% |
|---|---|
| 2nd row | 14.26% |
| 3rd row | 11.14% |
| 4th row | 13.17% |
| 5th row | 8.00% |
| Value | Count | Frequency (%) |
| 10.99 | 956 | 2.4% |
| 13.49 | 826 | 2.1% |
| 11.49 | 825 | 2.1% |
| 7.51 | 787 | 2.0% |
| 7.88 | 725 | 1.8% |
| 7.49 | 656 | 1.7% |
| 11.71 | 607 | 1.5% |
| 9.99 | 603 | 1.5% |
| 7.90 | 582 | 1.5% |
| 5.42 | 573 | 1.4% |
| Other values (361) | 32577 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 146726 | |
| Other Punctuation | 79434 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 8.7% |
| 7 | 12132 | 8.3% |
| 6 | 12033 | 8.2% |
| 4 | 11091 | 7.6% |
| 5 | 9947 | 6.8% |
| 3 | 9929 | 6.8% |
| 8 | 9527 | 6.5% |
| 0 | 9245 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 226160 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
installment
Real number (ℝ)
| Distinct | 15383 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 324.56192 |
| Minimum | 15.69 |
|---|---|
| Maximum | 1305.19 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 15.69 |
|---|---|
| 5-th percentile | 71.246 |
| Q1 | 167.02 |
| median | 280.22 |
| Q3 | 430.78 |
| 95-th percentile | 762.996 |
| Maximum | 1305.19 |
| Range | 1289.5 |
| Interquartile range (IQR) | 263.76 |
Descriptive statistics
| Standard deviation | 208.87487 |
|---|---|
| Coefficient of variation (CV) | 0.64355939 |
| Kurtosis | 1.2468013 |
| Mean | 324.56192 |
| Median Absolute Deviation (MAD) | 123.2 |
| Skewness | 1.1284191 |
| Sum | 12890626 |
| Variance | 43628.713 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 311.11 | 68 | 0.2% |
| 180.96 | 59 | 0.1% |
| 311.02 | 54 | 0.1% |
| 150.8 | 48 | 0.1% |
| 368.45 | 46 | 0.1% |
| 372.12 | 45 | 0.1% |
| 330.76 | 43 | 0.1% |
| 339.31 | 42 | 0.1% |
| 301.6 | 41 | 0.1% |
| 317.72 | 41 | 0.1% |
| Other values (15373) | 39230 |
| Value | Count | Frequency (%) |
| 15.69 | 1 | |
| 16.08 | 1 | |
| 16.25 | 1 | |
| 16.31 | 1 | |
| 16.47 | 1 | |
| 19.87 | 1 | |
| 20.22 | 1 | |
| 21.25 | 1 | |
| 21.74 | 1 | |
| 21.81 | 1 |
| Value | Count | Frequency (%) |
| 1305.19 | 1 | < 0.1% |
| 1302.69 | 1 | < 0.1% |
| 1295.21 | 1 | < 0.1% |
| 1288.1 | 2 | < 0.1% |
| 1283.5 | 1 | < 0.1% |
| 1276.6 | 3 | |
| 1272.2 | 1 | < 0.1% |
| 1269.73 | 5 | |
| 1265.16 | 1 | < 0.1% |
| 1263.23 | 1 | < 0.1% |
grade
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| B | |
|---|---|
| A | |
| C | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39717 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | C |
| 3rd row | B |
| 4th row | D |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 12020 | |
| a | 10085 | |
| c | 8098 | |
| d | 5307 | |
| e | 2842 | 7.2% |
| f | 1049 | 2.6% |
| g | 316 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39717 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39717 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39717 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
sub_grade
Categorical
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| B3 | |
|---|---|
| A4 | |
| A5 | |
| B5 | |
| B4 | 2512 |
| Other values (30) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 79434 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A5 |
|---|---|
| 2nd row | C5 |
| 3rd row | B1 |
| 4th row | D2 |
| 5th row | A3 |
Common Values
| Value | Count | Frequency (%) |
| B3 | 2917 | 7.3% |
| A4 | 2886 | 7.3% |
| A5 | 2742 | 6.9% |
| B5 | 2704 | 6.8% |
| B4 | 2512 | 6.3% |
| C1 | 2136 | 5.4% |
| B2 | 2057 | 5.2% |
| C2 | 2011 | 5.1% |
| B1 | 1830 | 4.6% |
| A3 | 1810 | 4.6% |
| Other values (25) | 16112 |
Length
| Value | Count | Frequency (%) |
| b3 | 2917 | 7.3% |
| a4 | 2886 | 7.3% |
| a5 | 2742 | 6.9% |
| b5 | 2704 | 6.8% |
| b4 | 2512 | 6.3% |
| c1 | 2136 | 5.4% |
| b2 | 2057 | 5.2% |
| c2 | 2011 | 5.1% |
| b1 | 1830 | 4.6% |
| a3 | 1810 | 4.6% |
| Other values (25) | 16112 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| 4 | 8293 | |
| 3 | 8215 | |
| C | 8098 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 | |
| D | 5307 | |
| E | 2842 | 3.6% |
| Other values (2) | 1365 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39717 | |
| Decimal Number | 39717 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 8293 | |
| 3 | 8215 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39717 | |
| Common | 39717 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Common
| Value | Count | Frequency (%) |
| 4 | 8293 | |
| 3 | 8215 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79434 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| 4 | 8293 | |
| 3 | 8215 | |
| C | 8098 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 | |
| D | 5307 | |
| E | 2842 | 3.6% |
| Other values (2) | 1365 | 1.7% |
emp_title
Text
MISSING 
| Distinct | 28820 |
|---|---|
| Distinct (%) | 77.4% |
| Missing | 2459 |
| Missing (%) | 6.2% |
| Memory size | 310.4 KiB |
Length
| Max length | 78 |
|---|---|
| Median length | 55 |
| Mean length | 18.379784 |
| Min length | 2 |
Characters and Unicode
| Total characters | 684794 |
|---|---|
| Distinct characters | 96 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 25641 ? |
|---|---|
| Unique (%) | 68.8% |
Sample
| 1st row | Infotrieve, Inc. |
|---|---|
| 2nd row | UBS |
| 3rd row | kmex/univision |
| 4th row | GAP |
| 5th row | State of Michigan |
| Value | Count | Frequency (%) |
| inc | 3197 | 3.2% |
| of | 3008 | 3.0% |
| 1208 | 1.2% | |
| and | 963 | 1.0% |
| center | 818 | 0.8% |
| bank | 805 | 0.8% |
| county | 803 | 0.8% |
| services | 795 | 0.8% |
| school | 750 | 0.7% |
| the | 747 | 0.7% |
| Other values (18882) | 87491 |
Most occurring characters
| Value | Count | Frequency (%) |
| 64766 | 9.5% | |
| e | 55954 | 8.2% |
| a | 43836 | 6.4% |
| n | 42641 | 6.2% |
| o | 42586 | 6.2% |
| i | 40491 | 5.9% |
| r | 40067 | 5.9% |
| t | 38580 | 5.6% |
| s | 30254 | 4.4% |
| l | 25923 | 3.8% |
| Other values (86) | 259696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 489338 | |
| Uppercase Letter | 119545 | 17.5% |
| Space Separator | 64766 | 9.5% |
| Other Punctuation | 8798 | 1.3% |
| Dash Punctuation | 1031 | 0.2% |
| Decimal Number | 968 | 0.1% |
| Open Punctuation | 159 | < 0.1% |
| Close Punctuation | 156 | < 0.1% |
| Math Symbol | 21 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
| Other values (5) | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14579 | 12.2% |
| S | 13325 | 11.1% |
| A | 8885 | 7.4% |
| I | 7566 | 6.3% |
| M | 6518 | 5.5% |
| P | 6077 | 5.1% |
| T | 5691 | 4.8% |
| L | 5561 | 4.7% |
| E | 5241 | 4.4% |
| D | 5056 | 4.2% |
| Other values (18) | 41046 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 55954 | |
| a | 43836 | |
| n | 42641 | |
| o | 42586 | |
| i | 40491 | 8.3% |
| r | 40067 | 8.2% |
| t | 38580 | 7.9% |
| s | 30254 | 6.2% |
| l | 25923 | 5.3% |
| c | 23099 | 4.7% |
| Other values (17) | 105907 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4253 | |
| , | 2194 | |
| & | 1301 | 14.8% |
| ' | 652 | 7.4% |
| / | 311 | 3.5% |
| # | 36 | 0.4% |
| @ | 10 | 0.1% |
| : | 9 | 0.1% |
| ! | 8 | 0.1% |
| " | 8 | 0.1% |
| Other values (5) | 16 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 192 | |
| 2 | 161 | |
| 3 | 155 | |
| 0 | 98 | |
| 4 | 91 | |
| 5 | 72 | 7.4% |
| 9 | 62 | 6.4% |
| 6 | 58 | 6.0% |
| 7 | 46 | 4.8% |
| 8 | 33 | 3.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 18 | |
| | | 2 | 9.5% |
| < | 1 | 4.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 158 | |
| [ | 1 | 0.6% |
Control
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 | |
| $ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 64766 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1031 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 156 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 2 |
Other Number
| Value | Count | Frequency (%) |
| ² | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 608883 | |
| Common | 75911 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 55954 | 9.2% |
| a | 43836 | 7.2% |
| n | 42641 | 7.0% |
| o | 42586 | 7.0% |
| i | 40491 | 6.7% |
| r | 40067 | 6.6% |
| t | 38580 | 6.3% |
| s | 30254 | 5.0% |
| l | 25923 | 4.3% |
| c | 23099 | 3.8% |
| Other values (45) | 225452 |
Common
| Value | Count | Frequency (%) |
| 64766 | ||
| . | 4253 | 5.6% |
| , | 2194 | 2.9% |
| & | 1301 | 1.7% |
| - | 1031 | 1.4% |
| ' | 652 | 0.9% |
| / | 311 | 0.4% |
| 1 | 192 | 0.3% |
| 2 | 161 | 0.2% |
| ( | 158 | 0.2% |
| Other values (31) | 892 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 684780 | |
| None | 14 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 64766 | 9.5% | |
| e | 55954 | 8.2% |
| a | 43836 | 6.4% |
| n | 42641 | 6.2% |
| o | 42586 | 6.2% |
| i | 40491 | 5.9% |
| r | 40067 | 5.9% |
| t | 38580 | 5.6% |
| s | 30254 | 4.4% |
| l | 25923 | 3.8% |
| Other values (77) | 259682 |
None
| Value | Count | Frequency (%) |
| Ã | 3 | |
| © | 2 | |
| ² | 2 | |
| Â | 2 | |
| | 1 | 7.1% |
| ¢ | 1 | 7.1% |
| â | 1 | 7.1% |
| | 1 | 7.1% |
| ¡ | 1 | 7.1% |
emp_length
Categorical
MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1075 |
| Missing (%) | 2.7% |
| Memory size | 310.4 KiB |
| 10+ years | |
|---|---|
| < 1 year | |
| 2 years | |
| 3 years | |
| 4 years | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.4943067 |
| Min length | 6 |
Characters and Unicode
| Total characters | 289595 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | < 1 year |
|---|---|
| 2nd row | 3 years |
| 3rd row | < 1 year |
| 4th row | 10+ years |
| 5th row | < 1 year |
Common Values
| Value | Count | Frequency (%) |
| 10+ years | 8879 | |
| < 1 year | 4583 | |
| 2 years | 4388 | |
| 3 years | 4095 | |
| 4 years | 3436 | 8.7% |
| 5 years | 3282 | 8.3% |
| 1 year | 3240 | 8.2% |
| 6 years | 2229 | 5.6% |
| 7 years | 1773 | 4.5% |
| 8 years | 1479 | 3.7% |
Length
| Value | Count | Frequency (%) |
| years | 30819 | |
| 10 | 8879 | 10.8% |
| 1 | 7823 | 9.6% |
| year | 7823 | 9.6% |
| 4583 | 5.6% | |
| 2 | 4388 | 5.4% |
| 3 | 4095 | 5.0% |
| 4 | 3436 | 4.2% |
| 5 | 3282 | 4.0% |
| 6 | 2229 | 2.7% |
| Other values (3) | 4510 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 43225 | ||
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 | |
| 1 | 16702 | 5.8% |
| 0 | 8879 | 3.1% |
| + | 8879 | 3.1% |
| < | 4583 | 1.6% |
| Other values (8) | 21940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 185387 | |
| Decimal Number | 47521 | 16.4% |
| Space Separator | 43225 | 14.9% |
| Math Symbol | 13462 | 4.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16702 | |
| 0 | 8879 | |
| 2 | 4388 | 9.2% |
| 3 | 4095 | 8.6% |
| 4 | 3436 | 7.2% |
| 5 | 3282 | 6.9% |
| 6 | 2229 | 4.7% |
| 7 | 1773 | 3.7% |
| 8 | 1479 | 3.1% |
| 9 | 1258 | 2.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 8879 | |
| < | 4583 |
Space Separator
| Value | Count | Frequency (%) |
| 43225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 185387 | |
| Common | 104208 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 43225 | ||
| 1 | 16702 | 16.0% |
| 0 | 8879 | 8.5% |
| + | 8879 | 8.5% |
| < | 4583 | 4.4% |
| 2 | 4388 | 4.2% |
| 3 | 4095 | 3.9% |
| 4 | 3436 | 3.3% |
| 5 | 3282 | 3.1% |
| 6 | 2229 | 2.1% |
| Other values (3) | 4510 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 289595 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 43225 | ||
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 | |
| 1 | 16702 | 5.8% |
| 0 | 8879 | 3.1% |
| + | 8879 | 3.1% |
| < | 4583 | 1.6% |
| Other values (8) | 21940 |
home_ownership
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 98 |
| NONE | 3 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 5.7039555 |
| Min length | 3 |
Characters and Unicode
| Total characters | 226544 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MORTGAGE |
|---|---|
| 2nd row | MORTGAGE |
| 3rd row | MORTGAGE |
| 4th row | RENT |
| 5th row | MORTGAGE |
Common Values
| Value | Count | Frequency (%) |
| RENT | 18899 | |
| MORTGAGE | 17659 | |
| OWN | 3058 | 7.7% |
| OTHER | 98 | 0.2% |
| NONE | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 18899 | |
| mortgage | 17659 | |
| own | 3058 | 7.7% |
| other | 98 | 0.2% |
| none | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 226544 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226544 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226544 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
annual_inc
Real number (ℝ)
SKEWED 
| Distinct | 5318 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68968.926 |
| Minimum | 4000 |
|---|---|
| Maximum | 6000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 4000 |
|---|---|
| 5-th percentile | 24000 |
| Q1 | 40404 |
| median | 59000 |
| Q3 | 82300 |
| 95-th percentile | 142000 |
| Maximum | 6000000 |
| Range | 5996000 |
| Interquartile range (IQR) | 41896 |
Descriptive statistics
| Standard deviation | 63793.766 |
|---|---|
| Coefficient of variation (CV) | 0.92496388 |
| Kurtosis | 2302.7378 |
| Mean | 68968.926 |
| Median Absolute Deviation (MAD) | 20000 |
| Skewness | 30.949185 |
| Sum | 2.7392388 × 109 |
| Variance | 4.0696446 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 1505 | 3.8% |
| 50000 | 1057 | 2.7% |
| 40000 | 876 | 2.2% |
| 45000 | 830 | 2.1% |
| 30000 | 825 | 2.1% |
| 75000 | 811 | 2.0% |
| 65000 | 803 | 2.0% |
| 70000 | 733 | 1.8% |
| 48000 | 723 | 1.8% |
| 80000 | 662 | 1.7% |
| Other values (5308) | 30892 |
| Value | Count | Frequency (%) |
| 4000 | 1 | < 0.1% |
| 4080 | 1 | < 0.1% |
| 4200 | 2 | < 0.1% |
| 4800 | 4 | |
| 4888 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 5500 | 1 | < 0.1% |
| 6000 | 5 | |
| 7000 | 1 | < 0.1% |
| 7200 | 4 |
| Value | Count | Frequency (%) |
| 6000000 | 1 | < 0.1% |
| 3900000 | 1 | < 0.1% |
| 2039784 | 1 | < 0.1% |
| 1900000 | 1 | < 0.1% |
| 1782000 | 1 | < 0.1% |
| 1440000 | 1 | < 0.1% |
| 1362000 | 1 | < 0.1% |
| 1250000 | 1 | < 0.1% |
| 1200000 | 4 | |
| 1176000 | 1 | < 0.1% |
verification_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Not Verified | |
|---|---|
| Verified | |
| Source Verified |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.464335 |
| Min length | 8 |
Characters and Unicode
| Total characters | 455329 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Verified |
|---|---|
| 2nd row | Not Verified |
| 3rd row | Not Verified |
| 4th row | Verified |
| 5th row | Not Verified |
Common Values
| Value | Count | Frequency (%) |
| Not Verified | 16921 | |
| Verified | 12809 | |
| Source Verified | 9987 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| verified | 39717 | |
| not | 16921 | |
| source | 9987 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 5.9% |
| 26908 | 5.9% | |
| N | 16921 | 3.7% |
| t | 16921 | 3.7% |
| Other values (3) | 29961 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 361796 | |
| Uppercase Letter | 66625 | 14.6% |
| Space Separator | 26908 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 7.4% |
| t | 16921 | 4.7% |
| u | 9987 | 2.8% |
| c | 9987 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 39717 | |
| N | 16921 | |
| S | 9987 | 15.0% |
Space Separator
| Value | Count | Frequency (%) |
| 26908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 428421 | |
| Common | 26908 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 6.3% |
| N | 16921 | 3.9% |
| t | 16921 | 3.9% |
| S | 9987 | 2.3% |
| Other values (2) | 19974 | 4.7% |
Common
| Value | Count | Frequency (%) |
| 26908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 455329 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 5.9% |
| 26908 | 5.9% | |
| N | 16921 | 3.7% |
| t | 16921 | 3.7% |
| Other values (3) | 29961 | 6.6% |
issue_d
Date
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Minimum | 2007-01-06 00:00:00 |
|---|---|
| Maximum | 2011-01-12 00:00:00 |
loan_status
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Fully Paid | |
|---|---|
| Charged Off | |
| Current | 1140 |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.055568 |
| Min length | 7 |
Characters and Unicode
| Total characters | 399377 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fully Paid |
|---|---|
| 2nd row | Fully Paid |
| 3rd row | Charged Off |
| 4th row | Fully Paid |
| 5th row | Fully Paid |
Common Values
| Value | Count | Frequency (%) |
| Fully Paid | 32950 | |
| Charged Off | 5627 | 14.2% |
| Current | 1140 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| fully | 32950 | |
| paid | 32950 | |
| charged | 5627 | 7.2% |
| off | 5627 | 7.2% |
| current | 1140 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 65900 | |
| 38577 | ||
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 2.8% |
| Other values (8) | 40602 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 282506 | |
| Uppercase Letter | 78294 | 19.6% |
| Space Separator | 38577 | 9.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 65900 | |
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| y | 32950 | |
| i | 32950 | |
| f | 11254 | 4.0% |
| r | 7907 | 2.8% |
| e | 6767 | 2.4% |
| g | 5627 | 2.0% |
| Other values (3) | 7907 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 32950 | |
| P | 32950 | |
| C | 6767 | 8.6% |
| O | 5627 | 7.2% |
Space Separator
| Value | Count | Frequency (%) |
| 38577 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 360800 | |
| Common | 38577 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 65900 | |
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 3.1% |
| r | 7907 | 2.2% |
| Other values (7) | 32695 |
Common
| Value | Count | Frequency (%) |
| 38577 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399377 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 65900 | |
| 38577 | ||
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 2.8% |
| Other values (8) | 40602 |
pymnt_plan
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.9 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 39717 |
url
Text
UNIQUE 
| Distinct | 39717 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 63 |
| Mean length | 63.108367 |
| Min length | 62 |
Characters and Unicode
| Total characters | 2506475 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 39717 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://lendingclub.com/browse/loanDetail.action?loan_id=69001 |
|---|---|
| 2nd row | https://lendingclub.com/browse/loanDetail.action?loan_id=59006 |
| 3rd row | https://lendingclub.com/browse/loanDetail.action?loan_id=65426 |
| 4th row | https://lendingclub.com/browse/loanDetail.action?loan_id=68926 |
| 5th row | https://lendingclub.com/browse/loanDetail.action?loan_id=69251 |
| Value | Count | Frequency (%) |
| https://lendingclub.com/browse/loandetail.action?loan_id=69001 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=69924 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=281384 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=281565 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=281651 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=65426 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=68926 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=69251 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=65640 | 1 | < 0.1% |
| https://lendingclub.com/browse/loandetail.action?loan_id=69828 | 1 | < 0.1% |
| Other values (39707) | 39707 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 198585 | 7.9% |
| l | 198585 | 7.9% |
| n | 198585 | 7.9% |
| a | 158868 | 6.3% |
| t | 158868 | 6.3% |
| / | 158868 | 6.3% |
| i | 158868 | 6.3% |
| c | 119151 | 4.8% |
| e | 119151 | 4.8% |
| . | 79434 | 3.2% |
| Other values (25) | 957512 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1826982 | |
| Other Punctuation | 317736 | 12.7% |
| Decimal Number | 242606 | 9.7% |
| Uppercase Letter | 39717 | 1.6% |
| Connector Punctuation | 39717 | 1.6% |
| Math Symbol | 39717 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 198585 | |
| l | 198585 | |
| n | 198585 | |
| a | 158868 | |
| t | 158868 | |
| i | 158868 | |
| c | 119151 | 6.5% |
| e | 119151 | 6.5% |
| b | 79434 | 4.3% |
| d | 79434 | 4.3% |
| Other values (8) | 357453 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 26616 | |
| 6 | 26607 | |
| 7 | 26037 | |
| 8 | 25774 | |
| 4 | 25584 | |
| 1 | 24160 | |
| 0 | 23856 | |
| 3 | 22052 | |
| 9 | 21694 | |
| 2 | 20226 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 158868 | |
| . | 79434 | |
| ? | 39717 | 12.5% |
| : | 39717 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 39717 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 39717 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 39717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1866699 | |
| Common | 639776 | 25.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 198585 | |
| l | 198585 | |
| n | 198585 | |
| a | 158868 | 8.5% |
| t | 158868 | 8.5% |
| i | 158868 | 8.5% |
| c | 119151 | 6.4% |
| e | 119151 | 6.4% |
| b | 79434 | 4.3% |
| d | 79434 | 4.3% |
| Other values (9) | 397170 |
Common
| Value | Count | Frequency (%) |
| / | 158868 | |
| . | 79434 | |
| ? | 39717 | 6.2% |
| _ | 39717 | 6.2% |
| = | 39717 | 6.2% |
| : | 39717 | 6.2% |
| 5 | 26616 | 4.2% |
| 6 | 26607 | 4.2% |
| 7 | 26037 | 4.1% |
| 8 | 25774 | 4.0% |
| Other values (6) | 137572 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2506475 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 198585 | 7.9% |
| l | 198585 | 7.9% |
| n | 198585 | 7.9% |
| a | 158868 | 6.3% |
| t | 158868 | 6.3% |
| / | 158868 | 6.3% |
| i | 158868 | 6.3% |
| c | 119151 | 4.8% |
| e | 119151 | 4.8% |
| . | 79434 | 3.2% |
| Other values (25) | 957512 |
desc
Text
MISSING 
| Distinct | 26526 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 12942 |
| Missing (%) | 32.6% |
| Memory size | 310.4 KiB |
Length
| Max length | 3988 |
|---|---|
| Median length | 2248 |
| Mean length | 426.5256 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11420223 |
|---|---|
| Distinct characters | 142 |
| Distinct categories | 17 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 26499 ? |
|---|---|
| Unique (%) | 99.0% |
Sample
| 1st row | Taking advantage of excellent credit to pay off credit card |
|---|---|
| 2nd row | I am seeking to refinance a credit account which I closed with a balance when I rejected the new terms of the cardmember agreement. This closed account is adversely affecting my credit utilization percentage and I would prefer to move it to a fixed-rate loan. I am a software developer who has been in a stable position with the same company since 2004. I am up-to-date on all payments and am seeking only to reduce the interest rate of this debt. Thank you for your consideration. |
| 3rd row | We currently have one car that is 19 years old and one that is 8 years old. The 19 year old car, which is the car my husband drives to his job at a local university, was just given about a month to live by our mechanic. We've gotten an amazing amount of use out of it but we will need to be sure to get a reliable vehicle before that one gives out. We hope to be able to donate it with some life left in it to a local non-profit. That is what we have done in the past with our old cars. Our mechanic will help us find a used car in great shape for around $10,000. We have saved about half of that but we really need to make a purchase soon. It would be fabulous to get a loan for a lower percentage rate than what our credit union offers. Currently that is probably about 11% for older vehicles. Thanks for considering us. |
| 4th row | I need a loan to cover moving expenses such as buying new furniture, deposit on the apt etc. |
| 5th row | Looking to pay bills with a lower rate and try a new type of lending. Please note my perfect credit history and ability to pay the account. Many Thanks Heather |
| Value | Count | Frequency (%) |
| i | 77512 | 3.8% |
| to | 71096 | 3.5% |
| a | 54855 | 2.7% |
| the | 54340 | 2.7% |
| and | 54329 | 2.6% |
| my | 51308 | 2.5% |
| on | 49132 | 2.4% |
| 37238 | 1.8% | |
| for | 32774 | 1.6% |
| have | 32490 | 1.6% |
| Other values (53986) | 1535184 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2121185 | ||
| e | 953962 | 8.4% |
| a | 714029 | 6.3% |
| o | 709011 | 6.2% |
| t | 649103 | 5.7% |
| n | 612135 | 5.4% |
| r | 589058 | 5.2% |
| i | 496003 | 4.3% |
| s | 426272 | 3.7% |
| d | 397984 | 3.5% |
| Other values (132) | 3751481 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8135698 | |
| Space Separator | 2121258 | 18.6% |
| Decimal Number | 346663 | 3.0% |
| Other Punctuation | 326744 | 2.9% |
| Uppercase Letter | 302770 | 2.7% |
| Math Symbol | 140645 | 1.2% |
| Currency Symbol | 16745 | 0.1% |
| Dash Punctuation | 13032 | 0.1% |
| Close Punctuation | 7337 | 0.1% |
| Open Punctuation | 6727 | 0.1% |
| Other values (7) | 2604 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 94559 | |
| B | 34778 | 11.5% |
| T | 28474 | 9.4% |
| A | 15522 | 5.1% |
| C | 14303 | 4.7% |
| M | 14262 | 4.7% |
| S | 9642 | 3.2% |
| E | 9211 | 3.0% |
| W | 8830 | 2.9% |
| L | 8656 | 2.9% |
| Other values (21) | 64533 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 953962 | |
| a | 714029 | 8.8% |
| o | 709011 | 8.7% |
| t | 649103 | 8.0% |
| n | 612135 | 7.5% |
| r | 589058 | 7.2% |
| i | 496003 | 6.1% |
| s | 426272 | 5.2% |
| d | 397984 | 4.9% |
| l | 355636 | 4.4% |
| Other values (18) | 2232505 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 120658 | |
| / | 116441 | |
| , | 50144 | |
| ' | 13317 | 4.1% |
| ! | 6738 | 2.1% |
| % | 5704 | 1.7% |
| : | 5281 | 1.6% |
| ; | 3357 | 1.0% |
| & | 2616 | 0.8% |
| " | 801 | 0.2% |
| Other values (10) | 1687 | 0.5% |
Control
| Value | Count | Frequency (%) |
| 1287 | ||
| | 411 | 19.3% |
| | 191 | 9.0% |
| | 38 | 1.8% |
| | 37 | 1.7% |
| | 35 | 1.6% |
| | 27 | 1.3% |
| | 27 | 1.3% |
| | 23 | 1.1% |
| | 15 | 0.7% |
| Other values (9) | 37 | 1.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 104184 | |
| 1 | 97487 | |
| 2 | 36710 | 10.6% |
| 5 | 21465 | 6.2% |
| 3 | 17828 | 5.1% |
| 9 | 16291 | 4.7% |
| 4 | 13709 | 4.0% |
| 6 | 13182 | 3.8% |
| 7 | 12927 | 3.7% |
| 8 | 12880 | 3.7% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 84845 | |
| < | 53870 | |
| + | 984 | 0.7% |
| = | 615 | 0.4% |
| ~ | 290 | 0.2% |
| ¬ | 31 | < 0.1% |
| | | 10 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 96 | |
| © | 15 | 13.0% |
| � | 2 | 1.7% |
| ® | 2 | 1.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13017 | |
| — | 9 | 0.1% |
| – | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7290 | |
| ] | 44 | 0.6% |
| } | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6681 | |
| [ | 44 | 0.7% |
| { | 2 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 13 | |
| ^ | 5 | 26.3% |
| ¯ | 1 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2121185 | ||
| 73 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 16671 | |
| ¢ | 74 | 0.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 78 | |
| ” | 18 | 18.8% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 18 | |
| ‘ | 3 | 14.3% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 6 | |
| ¾ | 2 | 25.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8438468 | |
| Common | 2981755 | 26.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2121185 | ||
| . | 120658 | 4.0% |
| / | 116441 | 3.9% |
| 0 | 104184 | 3.5% |
| 1 | 97487 | 3.3% |
| > | 84845 | 2.8% |
| < | 53870 | 1.8% |
| , | 50144 | 1.7% |
| 2 | 36710 | 1.2% |
| 5 | 21465 | 0.7% |
| Other values (73) | 174766 | 5.9% |
Latin
| Value | Count | Frequency (%) |
| e | 953962 | 11.3% |
| a | 714029 | 8.5% |
| o | 709011 | 8.4% |
| t | 649103 | 7.7% |
| n | 612135 | 7.3% |
| r | 589058 | 7.0% |
| i | 496003 | 5.9% |
| s | 426272 | 5.1% |
| d | 397984 | 4.7% |
| l | 355636 | 4.2% |
| Other values (49) | 2535275 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11418228 | |
| None | 1834 | < 0.1% |
| Punctuation | 159 | < 0.1% |
| Specials | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2121185 | ||
| e | 953962 | 8.4% |
| a | 714029 | 6.3% |
| o | 709011 | 6.2% |
| t | 649103 | 5.7% |
| n | 612135 | 5.4% |
| r | 589058 | 5.2% |
| i | 496003 | 4.3% |
| s | 426272 | 3.7% |
| d | 397984 | 3.5% |
| Other values (86) | 3749486 |
None
| Value | Count | Frequency (%) |
| â | 438 | |
| | 411 | |
| | 191 | |
| Â | 127 | 6.9% |
| Ã | 97 | 5.3% |
| ¦ | 96 | 5.2% |
| ¢ | 74 | 4.0% |
| 73 | 4.0% | |
| | 38 | 2.1% |
| | 37 | 2.0% |
| Other values (27) | 252 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 78 | |
| … | 19 | 11.9% |
| “ | 18 | 11.3% |
| ” | 18 | 11.3% |
| — | 9 | 5.7% |
| • | 8 | 5.0% |
| – | 6 | 3.8% |
| ‘ | 3 | 1.9% |
Specials
| Value | Count | Frequency (%) |
| � | 2 |
purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| other | |
| home_improvement | |
| major_purchase | |
| Other values (9) |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.736183 |
| Min length | 3 |
Characters and Unicode
| Total characters | 545560 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | credit_card |
| 3rd row | car |
| 4th row | moving |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| debt_consolidation | 18641 | |
| credit_card | 5130 | 12.9% |
| other | 3993 | 10.1% |
| home_improvement | 2976 | 7.5% |
| major_purchase | 2187 | 5.5% |
| small_business | 1828 | 4.6% |
| car | 1549 | 3.9% |
| wedding | 947 | 2.4% |
| medical | 693 | 1.7% |
| moving | 583 | 1.5% |
| Other values (4) | 1190 | 3.0% |
Length
| Value | Count | Frequency (%) |
| debt_consolidation | 18641 | |
| credit_card | 5130 | 12.9% |
| other | 3993 | 10.1% |
| home_improvement | 2976 | 7.5% |
| major_purchase | 2187 | 5.5% |
| small_business | 1828 | 4.6% |
| car | 1549 | 3.9% |
| wedding | 947 | 2.4% |
| medical | 693 | 1.7% |
| moving | 583 | 1.5% |
| Other values (4) | 1190 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | 8.0% |
| c | 34036 | 6.2% |
| a | 33730 | 6.2% |
| _ | 30865 | 5.7% |
| s | 28521 | 5.2% |
| Other values (12) | 109901 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 514695 | |
| Connector Punctuation | 30865 | 5.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | |
| c | 34036 | 6.6% |
| a | 33730 | 6.6% |
| s | 28521 | 5.5% |
| l | 23418 | 4.5% |
| Other values (11) | 86483 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 30865 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 514695 | |
| Common | 30865 | 5.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | |
| c | 34036 | 6.6% |
| a | 33730 | 6.6% |
| s | 28521 | 5.5% |
| l | 23418 | 4.5% |
| Other values (11) | 86483 |
Common
| Value | Count | Frequency (%) |
| _ | 30865 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 545560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | 8.0% |
| c | 34036 | 6.2% |
| a | 33730 | 6.2% |
| _ | 30865 | 5.7% |
| s | 28521 | 5.2% |
| Other values (12) | 109901 |
title
Text
| Distinct | 19615 |
|---|---|
| Distinct (%) | 49.4% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 310.4 KiB |
Length
| Max length | 80 |
|---|---|
| Median length | 72 |
| Mean length | 17.187327 |
| Min length | 1 |
Characters and Unicode
| Total characters | 682440 |
|---|---|
| Distinct characters | 108 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 17624 ? |
|---|---|
| Unique (%) | 44.4% |
Sample
| 1st row | Revolving Debt |
|---|---|
| 2nd row | Rejecting new cardmember agreement |
| 3rd row | djp |
| 4th row | tee_cee |
| 5th row | NewOrganic |
| Value | Count | Frequency (%) |
| loan | 10895 | 10.4% |
| debt | 9245 | 8.8% |
| consolidation | 8622 | 8.2% |
| credit | 4604 | 4.4% |
| card | 3341 | 3.2% |
| personal | 2043 | 2.0% |
| home | 1875 | 1.8% |
| pay | 1344 | 1.3% |
| off | 1259 | 1.2% |
| my | 1133 | 1.1% |
| Other values (8935) | 60203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 66029 | 9.7% | |
| o | 65729 | 9.6% |
| n | 55657 | 8.2% |
| e | 54557 | 8.0% |
| a | 50167 | 7.4% |
| i | 43822 | 6.4% |
| t | 42600 | 6.2% |
| d | 30679 | 4.5% |
| r | 29153 | 4.3% |
| s | 28544 | 4.2% |
| Other values (98) | 215503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 521300 | |
| Uppercase Letter | 83242 | 12.2% |
| Space Separator | 66029 | 9.7% |
| Decimal Number | 5995 | 0.9% |
| Other Punctuation | 4442 | 0.7% |
| Dash Punctuation | 824 | 0.1% |
| Connector Punctuation | 213 | < 0.1% |
| Close Punctuation | 104 | < 0.1% |
| Currency Symbol | 94 | < 0.1% |
| Math Symbol | 92 | < 0.1% |
| Other values (5) | 105 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 65729 | |
| n | 55657 | |
| e | 54557 | |
| a | 50167 | |
| i | 43822 | |
| t | 42600 | |
| d | 30679 | 5.9% |
| r | 29153 | 5.6% |
| s | 28544 | 5.5% |
| l | 26300 | 5.0% |
| Other values (18) | 94092 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 18509 | |
| L | 10335 | |
| D | 9244 | |
| P | 5641 | 6.8% |
| R | 3732 | 4.5% |
| M | 3256 | 3.9% |
| S | 3227 | 3.9% |
| B | 3116 | 3.7% |
| H | 2910 | 3.5% |
| I | 2885 | 3.5% |
| Other values (18) | 20387 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 1123 | |
| ' | 982 | |
| . | 712 | |
| / | 538 | |
| , | 435 | 9.8% |
| & | 328 | 7.4% |
| % | 95 | 2.1% |
| : | 64 | 1.4% |
| " | 56 | 1.3% |
| # | 25 | 0.6% |
| Other values (5) | 84 | 1.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1691 | |
| 0 | 1677 | |
| 2 | 1105 | |
| 3 | 299 | 5.0% |
| 5 | 256 | 4.3% |
| 9 | 254 | 4.2% |
| 4 | 216 | 3.6% |
| 6 | 178 | 3.0% |
| 8 | 169 | 2.8% |
| 7 | 150 | 2.5% |
Control
| Value | Count | Frequency (%) |
| | 4 | |
| | 4 | |
| | 4 | |
| 2 | ||
| | 2 | |
| | 1 | 5.3% |
| | 1 | 5.3% |
| 1 | 5.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 53 | |
| = | 19 | 20.7% |
| < | 9 | 9.8% |
| > | 8 | 8.7% |
| ~ | 2 | 2.2% |
| | | 1 | 1.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 | |
| ` | 1 | |
| ^ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 100 | |
| ] | 4 | 3.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 77 | |
| [ | 3 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 66029 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 824 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 213 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 94 |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 2 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 604542 | |
| Common | 77898 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 65729 | 10.9% |
| n | 55657 | 9.2% |
| e | 54557 | 9.0% |
| a | 50167 | 8.3% |
| i | 43822 | 7.2% |
| t | 42600 | 7.0% |
| d | 30679 | 5.1% |
| r | 29153 | 4.8% |
| s | 28544 | 4.7% |
| l | 26300 | 4.4% |
| Other values (46) | 177334 |
Common
| Value | Count | Frequency (%) |
| 66029 | ||
| 1 | 1691 | 2.2% |
| 0 | 1677 | 2.2% |
| ! | 1123 | 1.4% |
| 2 | 1105 | 1.4% |
| ' | 982 | 1.3% |
| - | 824 | 1.1% |
| . | 712 | 0.9% |
| / | 538 | 0.7% |
| , | 435 | 0.6% |
| Other values (42) | 2782 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 682408 | |
| None | 32 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 66029 | 9.7% | |
| o | 65729 | 9.6% |
| n | 55657 | 8.2% |
| e | 54557 | 8.0% |
| a | 50167 | 7.4% |
| i | 43822 | 6.4% |
| t | 42600 | 6.2% |
| d | 30679 | 4.5% |
| r | 29153 | 4.3% |
| s | 28544 | 4.2% |
| Other values (84) | 215471 |
None
| Value | Count | Frequency (%) |
| | 4 | |
| | 4 | |
| | 4 | |
| î | 4 | |
| â | 4 | |
| Ã | 2 | |
| ¦ | 2 | |
| | 2 | |
| | 1 | 3.1% |
| ´ | 1 | 3.1% |
| Other values (4) | 4 |
zip_code
Text
| Distinct | 823 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 198585 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 55 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 146xx |
|---|---|
| 2nd row | 775xx |
| 3rd row | 481xx |
| 4th row | 088xx |
| 5th row | 441xx |
| Value | Count | Frequency (%) |
| 100xx | 597 | 1.5% |
| 945xx | 545 | 1.4% |
| 112xx | 516 | 1.3% |
| 606xx | 503 | 1.3% |
| 070xx | 473 | 1.2% |
| 900xx | 453 | 1.1% |
| 021xx | 397 | 1.0% |
| 300xx | 394 | 1.0% |
| 926xx | 371 | 0.9% |
| 750xx | 367 | 0.9% |
| Other values (813) | 35101 |
Most occurring characters
| Value | Count | Frequency (%) |
| x | 79434 | |
| 0 | 19773 | 10.0% |
| 1 | 15629 | 7.9% |
| 2 | 13589 | 6.8% |
| 9 | 12681 | 6.4% |
| 3 | 12356 | 6.2% |
| 7 | 10257 | 5.2% |
| 4 | 9121 | 4.6% |
| 5 | 9020 | 4.5% |
| 8 | 8670 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 119151 | |
| Lowercase Letter | 79434 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 19773 | |
| 1 | 15629 | |
| 2 | 13589 | |
| 9 | 12681 | |
| 3 | 12356 | |
| 7 | 10257 | |
| 4 | 9121 | |
| 5 | 9020 | |
| 8 | 8670 | |
| 6 | 8055 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 79434 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 119151 | |
| Latin | 79434 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 19773 | |
| 1 | 15629 | |
| 2 | 13589 | |
| 9 | 12681 | |
| 3 | 12356 | |
| 7 | 10257 | |
| 4 | 9121 | |
| 5 | 9020 | |
| 8 | 8670 | |
| 6 | 8055 |
Latin
| Value | Count | Frequency (%) |
| x | 79434 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 198585 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| x | 79434 | |
| 0 | 19773 | 10.0% |
| 1 | 15629 | 7.9% |
| 2 | 13589 | 6.8% |
| 9 | 12681 | 6.4% |
| 3 | 12356 | 6.2% |
| 7 | 10257 | 5.2% |
| 4 | 9121 | 4.6% |
| 5 | 9020 | 4.5% |
| 8 | 8670 | 4.4% |
addr_state
Categorical
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| CA | |
|---|---|
| NY | |
| FL | |
| TX | |
| NJ | 1850 |
| Other values (45) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 79434 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | TX |
| 3rd row | MI |
| 4th row | NJ |
| 5th row | OH |
Common Values
| Value | Count | Frequency (%) |
| CA | 7099 | |
| NY | 3812 | 9.6% |
| FL | 2866 | 7.2% |
| TX | 2727 | 6.9% |
| NJ | 1850 | 4.7% |
| IL | 1525 | 3.8% |
| PA | 1517 | 3.8% |
| VA | 1407 | 3.5% |
| GA | 1398 | 3.5% |
| MA | 1340 | 3.4% |
| Other values (40) | 14176 |
Length
| Value | Count | Frequency (%) |
| ca | 7099 | |
| ny | 3812 | 9.6% |
| fl | 2866 | 7.2% |
| tx | 2727 | 6.9% |
| nj | 1850 | 4.7% |
| il | 1525 | 3.8% |
| pa | 1517 | 3.8% |
| va | 1407 | 3.5% |
| ga | 1398 | 3.5% |
| ma | 1340 | 3.4% |
| Other values (40) | 14176 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 79434 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79434 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79434 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
dti
Real number (ℝ)
| Distinct | 2868 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.31513 |
| Minimum | 0 |
|---|---|
| Maximum | 29.99 |
| Zeros | 183 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.13 |
| Q1 | 8.17 |
| median | 13.4 |
| Q3 | 18.6 |
| 95-th percentile | 23.84 |
| Maximum | 29.99 |
| Range | 29.99 |
| Interquartile range (IQR) | 10.43 |
Descriptive statistics
| Standard deviation | 6.6785936 |
|---|---|
| Coefficient of variation (CV) | 0.50157932 |
| Kurtosis | -0.85201548 |
| Mean | 13.31513 |
| Median Absolute Deviation (MAD) | 5.21 |
| Skewness | -0.028043331 |
| Sum | 528837 |
| Variance | 44.603612 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 183 | 0.5% |
| 12 | 51 | 0.1% |
| 18 | 45 | 0.1% |
| 19.2 | 40 | 0.1% |
| 13.2 | 39 | 0.1% |
| 16.8 | 38 | 0.1% |
| 12.48 | 38 | 0.1% |
| 13.5 | 38 | 0.1% |
| 6 | 37 | 0.1% |
| 14.29 | 36 | 0.1% |
| Other values (2858) | 39172 |
| Value | Count | Frequency (%) |
| 0 | 183 | |
| 0.01 | 3 | < 0.1% |
| 0.02 | 5 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 3 | < 0.1% |
| 0.05 | 2 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 5 | < 0.1% |
| 0.08 | 5 | < 0.1% |
| 0.09 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.99 | 1 | < 0.1% |
| 29.95 | 1 | < 0.1% |
| 29.93 | 3 | |
| 29.92 | 2 | |
| 29.89 | 1 | < 0.1% |
| 29.88 | 1 | < 0.1% |
| 29.86 | 2 | |
| 29.85 | 1 | < 0.1% |
| 29.83 | 1 | < 0.1% |
| 29.82 | 1 | < 0.1% |
delinq_2yrs
Real number (ℝ)
ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14651157 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 35405 |
| Zeros (%) | 89.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.49181152 |
|---|---|
| Coefficient of variation (CV) | 3.3568101 |
| Kurtosis | 39.4125 |
| Mean | 0.14651157 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.0220352 |
| Sum | 5819 |
| Variance | 0.24187857 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35405 | |
| 1 | 3303 | 8.3% |
| 2 | 687 | 1.7% |
| 3 | 220 | 0.6% |
| 4 | 62 | 0.2% |
| 5 | 22 | 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 35405 | |
| 1 | 3303 | 8.3% |
| 2 | 687 | 1.7% |
| 3 | 220 | 0.6% |
| 4 | 62 | 0.2% |
| 5 | 22 | 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 10 | < 0.1% |
| 5 | 22 | 0.1% |
| 4 | 62 | 0.2% |
| 3 | 220 | 0.6% |
| 2 | 687 | 1.7% |
| 1 | 3303 |
earliest_cr_line
Date
| Distinct | 526 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Minimum | 1946-01-01 00:00:00 |
|---|---|
| Maximum | 2008-01-11 00:00:00 |
inq_last_6mths
Real number (ℝ)
ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.86919959 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 19300 |
| Zeros (%) | 48.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0702193 |
|---|---|
| Coefficient of variation (CV) | 1.23127 |
| Kurtosis | 2.5621599 |
| Mean | 0.86919959 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3903909 |
| Sum | 34522 |
| Variance | 1.1453694 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 1 | 10971 | |
| 2 | 5812 | 14.6% |
| 3 | 3048 | 7.7% |
| 4 | 326 | 0.8% |
| 5 | 146 | 0.4% |
| 6 | 64 | 0.2% |
| 7 | 35 | 0.1% |
| 8 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 1 | 10971 | |
| 2 | 5812 | 14.6% |
| 3 | 3048 | 7.7% |
| 4 | 326 | 0.8% |
| 5 | 146 | 0.4% |
| 6 | 64 | 0.2% |
| 7 | 35 | 0.1% |
| 8 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 15 | < 0.1% |
| 7 | 35 | 0.1% |
| 6 | 64 | 0.2% |
| 5 | 146 | 0.4% |
| 4 | 326 | 0.8% |
| 3 | 3048 | 7.7% |
| 2 | 5812 | 14.6% |
| 1 | 10971 | |
| 0 | 19300 |
mths_since_last_delinq
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 25682 |
| Missing (%) | 64.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.900962 |
| Minimum | 0 |
|---|---|
| Maximum | 120 |
| Zeros | 443 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 18 |
| median | 34 |
| Q3 | 52 |
| 95-th percentile | 75 |
| Maximum | 120 |
| Range | 120 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 22.02006 |
|---|---|
| Coefficient of variation (CV) | 0.6133557 |
| Kurtosis | -0.84257778 |
| Mean | 35.900962 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.30643687 |
| Sum | 503870 |
| Variance | 484.88302 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 443 | 1.1% |
| 15 | 252 | 0.6% |
| 23 | 247 | 0.6% |
| 30 | 247 | 0.6% |
| 24 | 241 | 0.6% |
| 19 | 238 | 0.6% |
| 38 | 237 | 0.6% |
| 20 | 233 | 0.6% |
| 18 | 231 | 0.6% |
| 22 | 231 | 0.6% |
| Other values (85) | 11435 | |
| (Missing) | 25682 |
| Value | Count | Frequency (%) |
| 0 | 443 | |
| 1 | 30 | 0.1% |
| 2 | 101 | 0.3% |
| 3 | 145 | 0.4% |
| 4 | 153 | 0.4% |
| 5 | 151 | 0.4% |
| 6 | 192 | |
| 7 | 176 | 0.4% |
| 8 | 168 | 0.4% |
| 9 | 182 |
| Value | Count | Frequency (%) |
| 120 | 1 | |
| 115 | 1 | |
| 107 | 1 | |
| 106 | 1 | |
| 103 | 2 | |
| 97 | 1 | |
| 96 | 1 | |
| 95 | 1 | |
| 89 | 1 | |
| 86 | 2 |
mths_since_last_record
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 111 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 36931 |
| Missing (%) | 93.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.698134 |
| Minimum | 0 |
|---|---|
| Maximum | 129 |
| Zeros | 670 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 22 |
| median | 90 |
| Q3 | 104 |
| 95-th percentile | 115 |
| Maximum | 129 |
| Range | 129 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 43.822529 |
|---|---|
| Coefficient of variation (CV) | 0.62874753 |
| Kurtosis | -1.1565557 |
| Mean | 69.698134 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.71722858 |
| Sum | 194179 |
| Variance | 1920.4141 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 670 | 1.7% |
| 104 | 61 | 0.2% |
| 89 | 60 | 0.2% |
| 113 | 59 | 0.1% |
| 111 | 57 | 0.1% |
| 94 | 55 | 0.1% |
| 108 | 55 | 0.1% |
| 87 | 54 | 0.1% |
| 93 | 54 | 0.1% |
| 88 | 53 | 0.1% |
| Other values (101) | 1608 | 4.0% |
| (Missing) | 36931 |
| Value | Count | Frequency (%) |
| 0 | 670 | |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 17 | 3 | < 0.1% |
| 18 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 129 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 119 | 10 | < 0.1% |
| 118 | 36 | |
| 117 | 47 | |
| 116 | 41 | |
| 115 | 37 | |
| 114 | 51 | |
| 113 | 59 | |
| 112 | 39 |
open_acc
Real number (ℝ)
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.2944079 |
| Minimum | 2 |
|---|---|
| Maximum | 44 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 9 |
| Q3 | 12 |
| 95-th percentile | 17 |
| Maximum | 44 |
| Range | 42 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.4002825 |
|---|---|
| Coefficient of variation (CV) | 0.47343333 |
| Kurtosis | 1.677572 |
| Mean | 9.2944079 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.0037619 |
| Sum | 369146 |
| Variance | 19.362486 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 4018 | |
| 6 | 3946 | |
| 8 | 3936 | |
| 9 | 3718 | |
| 10 | 3223 | 8.1% |
| 5 | 3183 | 8.0% |
| 11 | 2746 | 6.9% |
| 4 | 2343 | 5.9% |
| 12 | 2273 | 5.7% |
| 13 | 1911 | 4.8% |
| Other values (30) | 8420 |
| Value | Count | Frequency (%) |
| 2 | 605 | 1.5% |
| 3 | 1493 | 3.8% |
| 4 | 2343 | |
| 5 | 3183 | |
| 6 | 3946 | |
| 7 | 4018 | |
| 8 | 3936 | |
| 9 | 3718 | |
| 10 | 3223 | |
| 11 | 2746 |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 2 | < 0.1% |
| 35 | 4 | |
| 34 | 5 | |
| 33 | 3 | |
| 32 | 4 |
pub_rec
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| 0 | |
|---|---|
| 1 | 2056 |
| 2 | 51 |
| 3 | 7 |
| 4 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39717 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39717 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39717 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39717 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
revol_bal
Real number (ℝ)
ZEROS 
| Distinct | 21711 |
|---|---|
| Distinct (%) | 54.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13382.528 |
| Minimum | 0 |
|---|---|
| Maximum | 149588 |
| Zeros | 994 |
| Zeros (%) | 2.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 321.8 |
| Q1 | 3703 |
| median | 8850 |
| Q3 | 17058 |
| 95-th percentile | 41656.4 |
| Maximum | 149588 |
| Range | 149588 |
| Interquartile range (IQR) | 13355 |
Descriptive statistics
| Standard deviation | 15885.017 |
|---|---|
| Coefficient of variation (CV) | 1.1869967 |
| Kurtosis | 14.896523 |
| Mean | 13382.528 |
| Median Absolute Deviation (MAD) | 6027 |
| Skewness | 3.1908837 |
| Sum | 5.3151387 × 108 |
| Variance | 2.5233375 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 994 | 2.5% |
| 255 | 14 | < 0.1% |
| 298 | 14 | < 0.1% |
| 1 | 12 | < 0.1% |
| 682 | 11 | < 0.1% |
| 1763 | 9 | < 0.1% |
| 10 | 9 | < 0.1% |
| 39 | 9 | < 0.1% |
| 6 | 9 | < 0.1% |
| 1159 | 9 | < 0.1% |
| Other values (21701) | 38627 |
| Value | Count | Frequency (%) |
| 0 | 994 | |
| 1 | 12 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 9 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 149588 | 1 | |
| 149527 | 1 | |
| 149000 | 1 | |
| 148829 | 1 | |
| 148804 | 1 | |
| 147897 | 1 | |
| 147750 | 1 | |
| 147559 | 1 | |
| 147451 | 1 | |
| 147365 | 1 |
revol_util
Text
| Distinct | 1089 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 50 |
| Missing (%) | 0.1% |
| Memory size | 310.4 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.8869085 |
| Min length | 5 |
Characters and Unicode
| Total characters | 233516 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 89 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 52.70% |
|---|---|
| 2nd row | 39.50% |
| 3rd row | 68.60% |
| 4th row | 88.40% |
| 5th row | 23.20% |
| Value | Count | Frequency (%) |
| 0.00 | 977 | 2.5% |
| 0.20 | 63 | 0.2% |
| 63.00 | 62 | 0.2% |
| 0.10 | 58 | 0.1% |
| 66.70 | 58 | 0.1% |
| 40.70 | 58 | 0.1% |
| 31.20 | 57 | 0.1% |
| 61.00 | 57 | 0.1% |
| 66.60 | 57 | 0.1% |
| 46.40 | 57 | 0.1% |
| Other values (1079) | 38163 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49323 | |
| . | 39667 | |
| % | 39667 | |
| 4 | 12082 | 5.2% |
| 5 | 12063 | 5.2% |
| 6 | 11989 | 5.1% |
| 7 | 11949 | 5.1% |
| 3 | 11885 | 5.1% |
| 2 | 11550 | 4.9% |
| 8 | 11419 | 4.9% |
| Other values (2) | 21922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 154182 | |
| Other Punctuation | 79334 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49323 | |
| 4 | 12082 | 7.8% |
| 5 | 12063 | 7.8% |
| 6 | 11989 | 7.8% |
| 7 | 11949 | 7.7% |
| 3 | 11885 | 7.7% |
| 2 | 11550 | 7.5% |
| 8 | 11419 | 7.4% |
| 1 | 11111 | 7.2% |
| 9 | 10811 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39667 | |
| % | 39667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 233516 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 49323 | |
| . | 39667 | |
| % | 39667 | |
| 4 | 12082 | 5.2% |
| 5 | 12063 | 5.2% |
| 6 | 11989 | 5.1% |
| 7 | 11949 | 5.1% |
| 3 | 11885 | 5.1% |
| 2 | 11550 | 4.9% |
| 8 | 11419 | 4.9% |
| Other values (2) | 21922 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 233516 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 49323 | |
| . | 39667 | |
| % | 39667 | |
| 4 | 12082 | 5.2% |
| 5 | 12063 | 5.2% |
| 6 | 11989 | 5.1% |
| 7 | 11949 | 5.1% |
| 3 | 11885 | 5.1% |
| 2 | 11550 | 4.9% |
| 8 | 11419 | 4.9% |
| Other values (2) | 21922 |
total_acc
Real number (ℝ)
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.088828 |
| Minimum | 2 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 13 |
| median | 20 |
| Q3 | 29 |
| 95-th percentile | 43 |
| Maximum | 90 |
| Range | 88 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.401709 |
|---|---|
| Coefficient of variation (CV) | 0.51617534 |
| Kurtosis | 0.6937402 |
| Mean | 22.088828 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.82737909 |
| Sum | 877302 |
| Variance | 129.99896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 1471 | 3.7% |
| 15 | 1462 | 3.7% |
| 17 | 1457 | 3.7% |
| 14 | 1445 | 3.6% |
| 20 | 1428 | 3.6% |
| 18 | 1422 | 3.6% |
| 21 | 1412 | 3.6% |
| 13 | 1385 | 3.5% |
| 19 | 1341 | 3.4% |
| 12 | 1325 | 3.3% |
| Other values (72) | 25569 |
| Value | Count | Frequency (%) |
| 2 | 4 | < 0.1% |
| 3 | 182 | 0.5% |
| 4 | 420 | 1.1% |
| 5 | 552 | |
| 6 | 683 | |
| 7 | 828 | |
| 8 | 1006 | |
| 9 | 1080 | |
| 10 | 1193 | |
| 11 | 1278 |
| Value | Count | Frequency (%) |
| 90 | 1 | |
| 87 | 1 | |
| 81 | 1 | |
| 80 | 1 | |
| 79 | 2 | |
| 78 | 1 | |
| 77 | 1 | |
| 76 | 2 | |
| 75 | 2 | |
| 74 | 1 |
initial_list_status
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.9 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 39717 |
out_prncp
Real number (ℝ)
ZEROS 
| Distinct | 1137 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.227887 |
| Minimum | 0 |
|---|---|
| Maximum | 6311.47 |
| Zeros | 38577 |
| Zeros (%) | 97.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6311.47 |
| Range | 6311.47 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 375.17284 |
|---|---|
| Coefficient of variation (CV) | 7.3236055 |
| Kurtosis | 97.658555 |
| Mean | 51.227887 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.22673 |
| Sum | 2034618 |
| Variance | 140754.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 2963.24 | 2 | < 0.1% |
| 827.13 | 2 | < 0.1% |
| 2277.11 | 2 | < 0.1% |
| 1972.6 | 2 | < 0.1% |
| 1347.43 | 1 | < 0.1% |
| 2540.31 | 1 | < 0.1% |
| 1978.94 | 1 | < 0.1% |
| 1231.2 | 1 | < 0.1% |
| 2614.43 | 1 | < 0.1% |
| Other values (1127) | 1127 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 10.26 | 1 | < 0.1% |
| 11.91 | 1 | < 0.1% |
| 13.28 | 1 | < 0.1% |
| 19.12 | 1 | < 0.1% |
| 27.41 | 1 | < 0.1% |
| 40.65 | 1 | < 0.1% |
| 50.46 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 57.67 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6311.47 | 1 | |
| 6308.37 | 1 | |
| 6307.37 | 1 | |
| 6307.15 | 1 | |
| 6219.16 | 1 | |
| 6219.11 | 1 | |
| 6182.86 | 1 | |
| 6071.68 | 1 | |
| 6034.37 | 1 | |
| 6027.7 | 1 |
out_prncp_inv
Real number (ℝ)
ZEROS 
| Distinct | 1138 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.989768 |
| Minimum | 0 |
|---|---|
| Maximum | 6307.37 |
| Zeros | 38577 |
| Zeros (%) | 97.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6307.37 |
| Range | 6307.37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 373.82446 |
|---|---|
| Coefficient of variation (CV) | 7.3313622 |
| Kurtosis | 98.040553 |
| Mean | 50.989768 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.2437655 |
| Sum | 2025160.6 |
| Variance | 139744.72 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 827.13 | 2 | < 0.1% |
| 1972.6 | 2 | < 0.1% |
| 1664.64 | 2 | < 0.1% |
| 1228.61 | 1 | < 0.1% |
| 2614.43 | 1 | < 0.1% |
| 1323.6 | 1 | < 0.1% |
| 1049.49 | 1 | < 0.1% |
| 1232.19 | 1 | < 0.1% |
| 1243.6 | 1 | < 0.1% |
| Other values (1128) | 1128 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 10.26 | 1 | < 0.1% |
| 11.91 | 1 | < 0.1% |
| 13.28 | 1 | < 0.1% |
| 19.09 | 1 | < 0.1% |
| 27.41 | 1 | < 0.1% |
| 40.65 | 1 | < 0.1% |
| 50.46 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 57.67 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6307.37 | 1 | |
| 6306.96 | 1 | |
| 6298.11 | 1 | |
| 6276.75 | 1 | |
| 6219.16 | 1 | |
| 6183.55 | 1 | |
| 6182.86 | 1 | |
| 6067.33 | 1 | |
| 6034.37 | 1 | |
| 6027.7 | 1 |
total_pymnt
Real number (ℝ)
| Distinct | 36591 |
|---|---|
| Distinct (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12153.597 |
| Minimum | 0 |
|---|---|
| Maximum | 58563.68 |
| Zeros | 16 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1887.954 |
| Q1 | 5576.93 |
| median | 9899.64 |
| Q3 | 16534.43 |
| 95-th percentile | 30245.116 |
| Maximum | 58563.68 |
| Range | 58563.68 |
| Interquartile range (IQR) | 10957.5 |
Descriptive statistics
| Standard deviation | 9042.0408 |
|---|---|
| Coefficient of variation (CV) | 0.74398066 |
| Kurtosis | 1.9858943 |
| Mean | 12153.597 |
| Median Absolute Deviation (MAD) | 5016.76 |
| Skewness | 1.3398574 |
| Sum | 4.8270439 × 108 |
| Variance | 81758501 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11196.57 | 26 | 0.1% |
| 6514.52 | 19 | < 0.1% |
| 10956.78 | 17 | < 0.1% |
| 13148.14 | 17 | < 0.1% |
| 0 | 16 | < 0.1% |
| 6717.95 | 16 | < 0.1% |
| 11784.23 | 16 | < 0.1% |
| 5478.39 | 15 | < 0.1% |
| 11907.35 | 14 | < 0.1% |
| 13517.36 | 13 | < 0.1% |
| Other values (36581) | 39548 |
| Value | Count | Frequency (%) |
| 0 | 16 | |
| 33.73 | 1 | < 0.1% |
| 35.71 | 1 | < 0.1% |
| 44.92 | 2 | < 0.1% |
| 44.96 | 1 | < 0.1% |
| 61.71 | 1 | < 0.1% |
| 62.86 | 1 | < 0.1% |
| 66.77 | 1 | < 0.1% |
| 67.32 | 1 | < 0.1% |
| 69.64 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 58563.68 | 1 | |
| 58480.14 | 1 | |
| 57835.28 | 1 | |
| 56849.27 | 1 | |
| 56662.59 | 1 | |
| 56199.44 | 1 | |
| 55906.95 | 1 | |
| 55768.78 | 1 | |
| 55368.41 | 1 | |
| 55139 | 1 |
total_pymnt_inv
Real number (ℝ)
| Distinct | 37518 |
|---|---|
| Distinct (%) | 94.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11567.149 |
| Minimum | 0 |
|---|---|
| Maximum | 58563.68 |
| Zeros | 165 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1420.408 |
| Q1 | 5112.31 |
| median | 9287.15 |
| Q3 | 15798.81 |
| 95-th percentile | 29627.236 |
| Maximum | 58563.68 |
| Range | 58563.68 |
| Interquartile range (IQR) | 10686.5 |
Descriptive statistics
| Standard deviation | 8942.6726 |
|---|---|
| Coefficient of variation (CV) | 0.77310948 |
| Kurtosis | 2.0297585 |
| Mean | 11567.149 |
| Median Absolute Deviation (MAD) | 4939.58 |
| Skewness | 1.3548376 |
| Sum | 4.5941246 × 108 |
| Variance | 79971393 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 165 | 0.4% |
| 6514.52 | 16 | < 0.1% |
| 5478.39 | 14 | < 0.1% |
| 13148.14 | 14 | < 0.1% |
| 10956.78 | 12 | < 0.1% |
| 6717.95 | 12 | < 0.1% |
| 11196.57 | 12 | < 0.1% |
| 7328.92 | 11 | < 0.1% |
| 13517.36 | 11 | < 0.1% |
| 5557.03 | 11 | < 0.1% |
| Other values (37508) | 39439 |
| Value | Count | Frequency (%) |
| 0 | 165 | |
| 0.54 | 1 | < 0.1% |
| 12.65 | 1 | < 0.1% |
| 18.97 | 1 | < 0.1% |
| 21.6 | 1 | < 0.1% |
| 25.18 | 1 | < 0.1% |
| 26.19 | 1 | < 0.1% |
| 33.73 | 1 | < 0.1% |
| 33.99 | 1 | < 0.1% |
| 35.71 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 58563.68 | 1 | |
| 58438.37 | 1 | |
| 57628.73 | 1 | |
| 56622.12 | 1 | |
| 56515.16 | 1 | |
| 55867.02 | 1 | |
| 55579.28 | 1 | |
| 55066.92 | 1 | |
| 54675.68 | 1 | |
| 54315.94 | 1 |
total_rec_prncp
Real number (ℝ)
| Distinct | 7976 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9793.3488 |
| Minimum | 0 |
|---|---|
| Maximum | 35000.02 |
| Zeros | 74 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1339.842 |
| Q1 | 4600 |
| median | 8000 |
| Q3 | 13653.26 |
| 95-th percentile | 24999.982 |
| Maximum | 35000.02 |
| Range | 35000.02 |
| Interquartile range (IQR) | 9053.26 |
Descriptive statistics
| Standard deviation | 7065.5221 |
|---|---|
| Coefficient of variation (CV) | 0.7214613 |
| Kurtosis | 1.1033555 |
| Mean | 9793.3488 |
| Median Absolute Deviation (MAD) | 4000 |
| Skewness | 1.1182545 |
| Sum | 3.8896243 × 108 |
| Variance | 49921603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2293 | 5.8% |
| 12000 | 1805 | 4.5% |
| 5000 | 1702 | 4.3% |
| 6000 | 1637 | 4.1% |
| 15000 | 1400 | 3.5% |
| 8000 | 1318 | 3.3% |
| 20000 | 1059 | 2.7% |
| 4000 | 956 | 2.4% |
| 3000 | 883 | 2.2% |
| 7000 | 851 | 2.1% |
| Other values (7966) | 25813 |
| Value | Count | Frequency (%) |
| 0 | 74 | |
| 21.21 | 1 | < 0.1% |
| 21.93 | 1 | < 0.1% |
| 22.24 | 1 | < 0.1% |
| 22.5 | 1 | < 0.1% |
| 24.87 | 1 | < 0.1% |
| 30.32 | 1 | < 0.1% |
| 32.51 | 1 | < 0.1% |
| 34.5 | 1 | < 0.1% |
| 35.14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000.02 | 2 | < 0.1% |
| 35000.01 | 1 | < 0.1% |
| 35000 | 363 | |
| 34999.99 | 5 | < 0.1% |
| 34999.98 | 1 | < 0.1% |
| 34999.97 | 1 | < 0.1% |
| 34911.47 | 1 | < 0.1% |
| 34800 | 1 | < 0.1% |
| 34793.43 | 1 | < 0.1% |
| 34675 | 1 | < 0.1% |
total_rec_int
Real number (ℝ)
| Distinct | 35148 |
|---|---|
| Distinct (%) | 88.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2263.6632 |
| Minimum | 0 |
|---|---|
| Maximum | 23563.68 |
| Zeros | 71 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 186.168 |
| Q1 | 662.18 |
| median | 1348.91 |
| Q3 | 2833.4 |
| 95-th percentile | 7575.812 |
| Maximum | 23563.68 |
| Range | 23563.68 |
| Interquartile range (IQR) | 2171.22 |
Descriptive statistics
| Standard deviation | 2608.112 |
|---|---|
| Coefficient of variation (CV) | 1.1521643 |
| Kurtosis | 9.6882784 |
| Mean | 2263.6632 |
| Median Absolute Deviation (MAD) | 866.01 |
| Skewness | 2.6687472 |
| Sum | 89905910 |
| Variance | 6802248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71 | 0.2% |
| 1196.57 | 26 | 0.1% |
| 514.52 | 19 | < 0.1% |
| 1784.23 | 17 | < 0.1% |
| 717.95 | 17 | < 0.1% |
| 1148.14 | 17 | < 0.1% |
| 956.78 | 17 | < 0.1% |
| 478.39 | 16 | < 0.1% |
| 1907.35 | 14 | < 0.1% |
| 1435.9 | 13 | < 0.1% |
| Other values (35138) | 39490 |
| Value | Count | Frequency (%) |
| 0 | 71 | |
| 6.22 | 1 | < 0.1% |
| 6.27 | 1 | < 0.1% |
| 7.19 | 1 | < 0.1% |
| 7.2 | 2 | < 0.1% |
| 8.23 | 1 | < 0.1% |
| 9.34 | 1 | < 0.1% |
| 9.49 | 1 | < 0.1% |
| 9.58 | 2 | < 0.1% |
| 10.26 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 23563.68 | 1 | |
| 23506.56 | 1 | |
| 23480.14 | 1 | |
| 22835.28 | 1 | |
| 22716.42 | 1 | |
| 22594.16 | 1 | |
| 22593.34 | 1 | |
| 22593.04 | 1 | |
| 22587.51 | 1 | |
| 22422.33 | 1 |
total_rec_late_fee
Real number (ℝ)
ZEROS 
| Distinct | 801 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3630194 |
| Minimum | 0 |
|---|---|
| Maximum | 180.2 |
| Zeros | 37671 |
| Zeros (%) | 94.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 14.92 |
| Maximum | 180.2 |
| Range | 180.2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.2899931 |
|---|---|
| Coefficient of variation (CV) | 5.3484149 |
| Kurtosis | 100.85133 |
| Mean | 1.3630194 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.4295237 |
| Sum | 54135.04 |
| Variance | 53.143999 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37671 | |
| 15 | 604 | 1.5% |
| 30 | 123 | 0.3% |
| 14.98 | 68 | 0.2% |
| 14.99 | 53 | 0.1% |
| 14.97 | 42 | 0.1% |
| 45 | 38 | 0.1% |
| 14.96 | 33 | 0.1% |
| 14.94 | 27 | 0.1% |
| 14.95 | 24 | 0.1% |
| Other values (791) | 1034 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 37671 | |
| 0.01 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.14 | 1 | < 0.1% |
| 0.18 | 2 | < 0.1% |
| 0.27 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 0.65 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 180.2 | 1 | |
| 166.43 | 1 | |
| 165.69 | 1 | |
| 146.6 | 1 | |
| 146.04 | 1 | |
| 134.07 | 1 | |
| 130.6 | 1 | |
| 130.47 | 1 | |
| 127.79 | 1 | |
| 121.93 | 1 |
recoveries
Real number (ℝ)
ZEROS 
| Distinct | 4040 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.221624 |
| Minimum | 0 |
|---|---|
| Maximum | 29623.35 |
| Zeros | 35499 |
| Zeros (%) | 89.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 362.418 |
| Maximum | 29623.35 |
| Range | 29623.35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 688.74477 |
|---|---|
| Coefficient of variation (CV) | 7.233071 |
| Kurtosis | 379.37757 |
| Mean | 95.221624 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.519378 |
| Sum | 3781917.2 |
| Variance | 474369.36 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35499 | |
| 10.4 | 4 | < 0.1% |
| 11.29 | 4 | < 0.1% |
| 11.2 | 3 | < 0.1% |
| 12.09 | 3 | < 0.1% |
| 44.92 | 3 | < 0.1% |
| 10.13 | 3 | < 0.1% |
| 14.61 | 3 | < 0.1% |
| 13 | 3 | < 0.1% |
| 13.93 | 3 | < 0.1% |
| Other values (4030) | 4189 | 10.5% |
| Value | Count | Frequency (%) |
| 0 | 35499 | |
| 6.3 | 1 | < 0.1% |
| 6.31 | 1 | < 0.1% |
| 8.19 | 1 | < 0.1% |
| 8.36 | 1 | < 0.1% |
| 8.41 | 1 | < 0.1% |
| 8.46 | 1 | < 0.1% |
| 8.56 | 1 | < 0.1% |
| 8.71 | 1 | < 0.1% |
| 8.88 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 29623.35 | 1 | |
| 22943.37 | 1 | |
| 21810.31 | 1 | |
| 20006.53 | 1 | |
| 19915.67 | 1 | |
| 19508.26 | 1 | |
| 18694.32 | 1 | |
| 16560.06 | 1 | |
| 16502.69 | 1 | |
| 16268.35 | 1 |
collection_recovery_fee
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 2118 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.406114 |
| Minimum | 0 |
|---|---|
| Maximum | 7002.19 |
| Zeros | 35935 |
| Zeros (%) | 90.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5.152 |
| Maximum | 7002.19 |
| Range | 7002.19 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 148.6716 |
|---|---|
| Coefficient of variation (CV) | 11.983736 |
| Kurtosis | 821.30052 |
| Mean | 12.406114 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.029416 |
| Sum | 492733.63 |
| Variance | 22103.245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35935 | |
| 2 | 12 | < 0.1% |
| 1.2 | 12 | < 0.1% |
| 0.8 | 11 | < 0.1% |
| 1.69 | 10 | < 0.1% |
| 3.23 | 10 | < 0.1% |
| 2.08 | 10 | < 0.1% |
| 3.71 | 9 | < 0.1% |
| 3.2 | 9 | < 0.1% |
| 1.6 | 9 | < 0.1% |
| Other values (2108) | 3690 | 9.3% |
| Value | Count | Frequency (%) |
| 0 | 35935 | |
| 0.06 | 1 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.14 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.2 | 3 | < 0.1% |
| 0.21 | 1 | < 0.1% |
| 0.22 | 1 | < 0.1% |
| 0.23 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 7002.19 | 1 | |
| 6972.59 | 1 | |
| 6543.04 | 1 | |
| 5774.8 | 1 | |
| 5602.72 | 1 | |
| 5569.92 | 1 | |
| 5216.74 | 1 | |
| 5036.01 | 1 | |
| 4902.08 | 1 | |
| 4900.75 | 1 |
last_pymnt_d
Date
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 71 |
| Missing (%) | 0.2% |
| Memory size | 310.4 KiB |
| Minimum | 2008-01-01 00:00:00 |
|---|---|
| Maximum | 2016-01-05 00:00:00 |
last_pymnt_amnt
Real number (ℝ)
| Distinct | 34930 |
|---|---|
| Distinct (%) | 87.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2678.8262 |
| Minimum | 0 |
|---|---|
| Maximum | 36115.2 |
| Zeros | 74 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 43.34 |
| Q1 | 218.68 |
| median | 546.14 |
| Q3 | 3293.16 |
| 95-th percentile | 12183.944 |
| Maximum | 36115.2 |
| Range | 36115.2 |
| Interquartile range (IQR) | 3074.48 |
Descriptive statistics
| Standard deviation | 4447.136 |
|---|---|
| Coefficient of variation (CV) | 1.6601062 |
| Kurtosis | 8.8678197 |
| Mean | 2678.8262 |
| Median Absolute Deviation (MAD) | 449.45 |
| Skewness | 2.7121222 |
| Sum | 1.0639494 × 108 |
| Variance | 19777019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 74 | 0.2% |
| 276.06 | 21 | 0.1% |
| 200 | 17 | < 0.1% |
| 50 | 16 | < 0.1% |
| 100 | 15 | < 0.1% |
| 400 | 12 | < 0.1% |
| 773.44 | 12 | < 0.1% |
| 786.01 | 11 | < 0.1% |
| 500 | 11 | < 0.1% |
| 150 | 11 | < 0.1% |
| Other values (34920) | 39517 |
| Value | Count | Frequency (%) |
| 0 | 74 | |
| 0.01 | 1 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.24 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.28 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 36115.2 | 1 | |
| 35613.68 | 1 | |
| 35596.41 | 1 | |
| 35479.89 | 1 | |
| 35471.86 | 1 | |
| 35395.59 | 1 | |
| 35339.05 | 1 | |
| 35337.09 | 1 | |
| 35322.96 | 1 | |
| 35322.6 | 1 |
next_pymnt_d
Date
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 38577 |
| Missing (%) | 97.1% |
| Memory size | 310.4 KiB |
| Minimum | 2016-01-06 00:00:00 |
|---|---|
| Maximum | 2016-01-07 00:00:00 |
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 310.4 KiB |
| Minimum | 2007-01-05 00:00:00 |
|---|---|
| Maximum | 2016-01-05 00:00:00 |
application_type
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| INDIVIDUAL |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 397170 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | INDIVIDUAL |
|---|---|
| 2nd row | INDIVIDUAL |
| 3rd row | INDIVIDUAL |
| 4th row | INDIVIDUAL |
| 5th row | INDIVIDUAL |
Common Values
| Value | Count | Frequency (%) |
| INDIVIDUAL | 39717 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| individual | 39717 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 119151 | |
| D | 79434 | |
| N | 39717 | 10.0% |
| V | 39717 | 10.0% |
| U | 39717 | 10.0% |
| A | 39717 | 10.0% |
| L | 39717 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 397170 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 119151 | |
| D | 79434 | |
| N | 39717 | 10.0% |
| V | 39717 | 10.0% |
| U | 39717 | 10.0% |
| A | 39717 | 10.0% |
| L | 39717 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 397170 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 119151 | |
| D | 79434 | |
| N | 39717 | 10.0% |
| V | 39717 | 10.0% |
| U | 39717 | 10.0% |
| A | 39717 | 10.0% |
| L | 39717 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 397170 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 119151 | |
| D | 79434 | |
| N | 39717 | 10.0% |
| V | 39717 | 10.0% |
| U | 39717 | 10.0% |
| A | 39717 | 10.0% |
| L | 39717 | 10.0% |
pub_rec_bankruptcies
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 697 |
| Missing (%) | 1.8% |
| Memory size | 310.4 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1674 |
| 2.0 | 7 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 117060 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 37339 | |
| 1.0 | 1674 | 4.2% |
| 2.0 | 7 | < 0.1% |
| (Missing) | 697 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 37339 | |
| 1.0 | 1674 | 4.3% |
| 2.0 | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 78040 | |
| Other Punctuation | 39020 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| 1 | 1674 | 2.1% |
| 2 | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39020 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 117060 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
| id | member_id | loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | pymnt_plan | url | desc | purpose | title | zip_code | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | mths_since_last_delinq | mths_since_last_record | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | out_prncp | out_prncp_inv | total_pymnt | total_pymnt_inv | total_rec_prncp | total_rec_int | total_rec_late_fee | recoveries | collection_recovery_fee | last_pymnt_d | last_pymnt_amnt | next_pymnt_d | last_credit_pull_d | application_type | pub_rec_bankruptcies | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 69001 | 265533 | 15000.0 | 15000.0 | 14875.00 | 36 months | 8.94% | 476.58 | A | A5 | NaN | < 1 year | MORTGAGE | 110000.0 | Not Verified | 01-09-2009 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=69001 | Taking advantage of excellent credit to pay off credit card | credit_card | Revolving Debt | 146xx | NY | 7.07 | 0.0 | 01-11-1991 | 1 | 0.0 | 0.0 | 6 | 0 | 7586.0 | 52.70% | 19 | f | 0.0 | 0.0 | 17135.51 | 16992.71 | 15000.00 | 2135.51 | 0.00 | 0.0 | 0.0 | 01-07-2012 | 1919.13 | NaN | 01-08-2015 | INDIVIDUAL | NaN |
| 1 | 59006 | 154254 | 3000.0 | 3000.0 | 2988.24 | 36 months | 14.26% | 102.92 | C | C5 | NaN | 3 years | MORTGAGE | 80800.0 | Not Verified | 01-09-2009 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=59006 | I am seeking to refinance a credit account which I closed with a balance when I rejected the new terms of the cardmember agreement. This closed account is adversely affecting my credit utilization percentage and I would prefer to move it to a fixed-rate loan. I am a software developer who has been in a stable position with the same company since 2004. I am up-to-date on all payments and am seeking only to reduce the interest rate of this debt. Thank you for your consideration. | credit_card | Rejecting new cardmember agreement | 775xx | TX | 14.97 | 1.0 | 01-07-1998 | 0 | 13.0 | 0.0 | 13 | 0 | 4740.0 | 39.50% | 23 | f | 0.0 | 0.0 | 3705.00 | 3688.85 | 3000.00 | 705.00 | 0.00 | 0.0 | 0.0 | 01-10-2012 | 111.23 | NaN | 01-09-2012 | INDIVIDUAL | NaN |
| 2 | 65426 | 232106 | 4000.0 | 4000.0 | 3892.26 | 36 months | 11.14% | 131.22 | B | B1 | Infotrieve, Inc. | < 1 year | MORTGAGE | 60000.0 | Not Verified | 01-08-2009 | Charged Off | n | https://lendingclub.com/browse/loanDetail.action?loan_id=65426 | We currently have one car that is 19 years old and one that is 8 years old. The 19 year old car, which is the car my husband drives to his job at a local university, was just given about a month to live by our mechanic. We've gotten an amazing amount of use out of it but we will need to be sure to get a reliable vehicle before that one gives out. We hope to be able to donate it with some life left in it to a local non-profit. That is what we have done in the past with our old cars. Our mechanic will help us find a used car in great shape for around $10,000. We have saved about half of that but we really need to make a purchase soon. It would be fabulous to get a loan for a lower percentage rate than what our credit union offers. Currently that is probably about 11% for older vehicles. Thanks for considering us. | car | djp | 481xx | MI | 11.08 | 0.0 | 01-08-1995 | 0 | 0.0 | 0.0 | 14 | 0 | 24220.0 | 68.60% | 33 | f | 0.0 | 0.0 | 2755.20 | 2615.80 | 2170.35 | 584.85 | 0.00 | 0.0 | 0.0 | 01-06-2011 | 131.22 | NaN | 01-05-2016 | INDIVIDUAL | NaN |
| 3 | 68926 | 264924 | 2300.0 | 2300.0 | 589.61 | 36 months | 13.17% | 77.69 | D | D2 | UBS | 10+ years | RENT | 37152.0 | Verified | 01-08-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68926 | I need a loan to cover moving expenses such as buying new furniture, deposit on the apt etc. | moving | tee_cee | 088xx | NJ | 2.26 | 0.0 | 01-12-1997 | 0 | 46.0 | 0.0 | 4 | 0 | 2211.0 | 88.40% | 13 | f | 0.0 | 0.0 | 2796.60 | 643.50 | 2300.00 | 496.60 | 0.00 | 0.0 | 0.0 | 01-09-2011 | 77.78 | NaN | 01-05-2016 | INDIVIDUAL | NaN |
| 4 | 69251 | 267771 | 6000.0 | 6000.0 | 500.00 | 36 months | 8.00% | 188.02 | A | A3 | NaN | < 1 year | MORTGAGE | 75000.0 | Not Verified | 01-05-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=69251 | Looking to pay bills with a lower rate and try a new type of lending. Please note my perfect credit history and ability to pay the account. Many Thanks Heather | other | NewOrganic | 441xx | OH | 16.08 | 0.0 | 01-12-1994 | 1 | 0.0 | 0.0 | 16 | 0 | 29797.0 | 23.20% | 39 | f | 0.0 | 0.0 | 6783.75 | 565.31 | 5999.99 | 768.76 | 15.00 | 0.0 | 0.0 | 01-05-2011 | 189.36 | NaN | 01-05-2011 | INDIVIDUAL | NaN |
| 5 | 65640 | 234569 | 5000.0 | 2650.0 | 495.49 | 36 months | 11.34% | 87.19 | C | C2 | kmex/univision | 10+ years | MORTGAGE | 90000.0 | Not Verified | 01-05-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=65640 | This money would be used to finish a remodeling kitchen project. | home_improvement | producer46 | 912xx | CA | 17.25 | 0.0 | 01-05-1997 | 1 | 0.0 | 0.0 | 20 | 0 | 69909.0 | 51.10% | 51 | f | 0.0 | 0.0 | 3153.80 | 512.18 | 2649.99 | 488.82 | 15.00 | 0.0 | 0.0 | 01-05-2011 | 87.83 | NaN | 01-04-2015 | INDIVIDUAL | NaN |
| 6 | 69924 | 274280 | 10000.0 | 10000.0 | 8790.33 | 36 months | 13.55% | 339.60 | D | D4 | GAP | 3 years | RENT | 100000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=69924 | Hello friends, I am trying to pay off couple of credit cards which has raised the APR recently, increasing my monthly payments Thanks Sunny | credit_card | Trying to pay off high interest cards | 941xx | CA | 7.94 | 0.0 | 01-11-2002 | 0 | 0.0 | 0.0 | 11 | 0 | 21162.0 | 57.70% | 14 | f | 0.0 | 0.0 | 12225.41 | 10738.03 | 10000.00 | 2225.41 | 0.00 | 0.0 | 0.0 | 01-04-2011 | 359.55 | NaN | 01-02-2016 | INDIVIDUAL | NaN |
| 7 | 69828 | 272798 | 15000.0 | 15000.0 | 13138.20 | 36 months | 8.63% | 474.42 | A | A5 | State of Michigan | 10+ years | OWN | 50000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=69828 | We have owned and operated a year round greenhouse/nursery, organic food processing operation in the the central Michigan area since 1990. Last season we had the opportunity to purchase 12,000 square foot of 4 year old greenhouse space, and we are seeking "investors" who would be willing to help us with reconstruction costs, so we will have the added greenhouse space for organic, sustainable, local food production. We also intend to make equipment upgrades to our commercial processing kitchen. | other | Business Expansion | 488xx | MI | 2.59 | 0.0 | 01-06-1975 | 0 | 0.0 | 0.0 | 4 | 0 | 5656.0 | 27.60% | 25 | f | 0.0 | 0.0 | 17208.18 | 15033.50 | 15000.00 | 2113.30 | 94.88 | 0.0 | 0.0 | 01-08-2011 | 38.20 | NaN | 01-08-2011 | INDIVIDUAL | NaN |
| 8 | 282707 | 282641 | 10000.0 | 10000.0 | 8741.04 | 36 months | 9.45% | 320.10 | B | B1 | Brightstar Corporation | 2 years | RENT | 70000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=282707 | I'm a successful 25 year old sales rep. This loan is for my education, but not books or a common education, more like a street smart education and experience. I'm practicing how to raise capital/borrow money to create profitable returns. This is a skill I want to master to create wealth. I'm also enjoying the experience as I don't "need" the money. I don't have a mortgage, my income exceeds my expenses and I'm very meticulous about spending. I have good control over myself mentally, spiritually and emotionally, therefore I have good control over my money. My only debt is a car lease and a interest free care credit loan for my lasik (i could have paid cash but why not get interest free financing). My credit card balances are paid in full during their billing cycle - they are used solely to build credit and get points. I don't have school loans as I did my undergrad as a Fulbright Scholar, and MBA's are overrated. So I'm pretty liquid. Even my wife's diamond ring is paid in full, and her parents paid for the wedding. So why borrow if I have money? Again, this is for the experience, I can comfortably afford the loan and interest, but I think this experience will make me better in business. There is bad debt, when you buy liabilities or things that depreciate. This is good debt, the kind that puts money in my pocket building an asset. Thanks for your time, Carlos | educational | Social Entrepreneur | 600xx | IL | 9.38 | 0.0 | 01-02-2003 | 0 | 0.0 | 0.0 | 8 | 0 | 8850.0 | 32.30% | 8 | f | 0.0 | 0.0 | 11466.39 | 10004.77 | 10000.00 | 1466.40 | 0.00 | 0.0 | 0.0 | 01-02-2011 | 306.86 | NaN | 01-02-2011 | INDIVIDUAL | NaN |
| 9 | 282569 | 274158 | 12000.0 | 12000.0 | 6475.00 | 36 months | 13.55% | 407.52 | D | D4 | Wachovia Bank | < 1 year | MORTGAGE | 55000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=282569 | A year ago I purchased my dream home and with that purchase, credit card debt slowly built up. Unfortunately the rates on these cards are through the roof. My plan was to take a year and agressively pay off the balances but rates are so high. When compounding interest enters into the equation I end up only paying off a small percentage of the principal. For the past year I have been paying $400-500 a month towards credit debt and need a solution to compounding interest. The idea of having a fixed rate is exactly what I am looking. Yes, the rates may be higher then traditional lenders but I like the concept of this new type of lending because there is a little more forgiveness towards the borrower. In fact I am a Financial Center Manager with Wachovia Bank but I have run into DTI problems on there grading scale for a loan. | credit_card | Refinance credit card debt | 191xx | PA | 14.99 | 0.0 | 01-11-1995 | 2 | 61.0 | 0.0 | 12 | 0 | 10918.0 | 59.00% | 45 | f | 0.0 | 0.0 | 13861.23 | 7479.29 | 12000.00 | 1861.23 | 0.00 | 0.0 | 0.0 | 01-08-2009 | 7355.37 | NaN | 01-09-2009 | INDIVIDUAL | NaN |
| id | member_id | loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | pymnt_plan | url | desc | purpose | title | zip_code | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | mths_since_last_delinq | mths_since_last_record | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | out_prncp | out_prncp_inv | total_pymnt | total_pymnt_inv | total_rec_prncp | total_rec_int | total_rec_late_fee | recoveries | collection_recovery_fee | last_pymnt_d | last_pymnt_amnt | next_pymnt_d | last_credit_pull_d | application_type | pub_rec_bankruptcies | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39707 | 286120 | 284863 | 12000.0 | 12000.0 | 9542.16 | 36 months | 13.55% | 407.52 | D | D4 | Bank | 2 years | MORTGAGE | 130000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=286120 | These funds will be used to buy an equity position in 40 acres of pine trees. The timber has been valued at $58,000. I will be a 50% partner in the tree farm. I will be putting $8,000 of personal cash towards the deal. This will be the second deal of similar nature for me. I have a great job in the financial industry and am a good candidate for this loan. Due to the volatility in real estate and the stock market, commodities such as timber are solid investments. | other | Timber Investment | 392xx | MS | 8.32 | 0.0 | 01-10-1999 | 0 | 24.0 | NaN | 9 | 0 | 22044.0 | 81.60% | 21 | f | 0.0 | 0.0 | 14689.66 | 11666.64 | 12000.00 | 2670.67 | 18.99 | 0.0 | 0.0 | 01-03-2011 | 434.37 | NaN | 01-03-2011 | INDIVIDUAL | 0.0 |
| 39708 | 285781 | 285778 | 14000.0 | 14000.0 | 10100.00 | 36 months | 9.45% | 448.14 | B | B1 | Citibank | 4 years | RENT | 42000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=285781 | I have been trying to pay off a couple of credit card debts and have consolidated them into two loans, but with 19.99% interest rates I feel most of the payments are going to interest. I would prefer a lower interest rate and be done with it in 3 years or less. | debt_consolidation | Finally Over | 107xx | NY | 5.31 | 0.0 | 01-11-1999 | 0 | 51.0 | NaN | 12 | 0 | 11556.0 | 24.30% | 18 | f | 0.0 | 0.0 | 14715.67 | 10616.43 | 14000.00 | 715.67 | 0.00 | 0.0 | 0.0 | 01-10-2008 | 12028.28 | NaN | 01-10-2008 | INDIVIDUAL | 0.0 |
| 39709 | 285738 | 285732 | 4000.0 | 4000.0 | 3000.00 | 36 months | 10.39% | 129.81 | B | B4 | Wachovia Corp. | < 1 year | MORTGAGE | 53000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=285738 | I need to pay high interest capital one card so that it would make sense to have a higher payment to pay it off sooner. | credit_card | Interest rate too high on credit card | 917xx | CA | 13.09 | 0.0 | 01-01-1996 | 1 | 44.0 | NaN | 11 | 0 | 1308.0 | 6.90% | 25 | f | 0.0 | 0.0 | 4383.95 | 3287.98 | 4000.00 | 383.95 | 0.00 | 0.0 | 0.0 | 01-04-2009 | 2827.72 | NaN | 01-12-2012 | INDIVIDUAL | 0.0 |
| 39710 | 285386 | 285383 | 8000.0 | 8000.0 | 6525.00 | 36 months | 8.63% | 253.03 | A | A5 | Harland Electric | 10+ years | MORTGAGE | 54080.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=285386 | loan used to pay off credit cards and pay for home improvements | other | personal loan | 014xx | MA | 10.78 | 0.0 | 01-11-1998 | 1 | NaN | NaN | 7 | 0 | 5623.0 | 67.70% | 14 | f | 0.0 | 0.0 | 9045.35 | 7377.70 | 8000.00 | 1045.35 | 0.00 | 0.0 | 0.0 | 01-07-2010 | 2218.89 | NaN | 01-11-2012 | INDIVIDUAL | 0.0 |
| 39711 | 284637 | 284630 | 12000.0 | 12000.0 | 8425.00 | 36 months | 12.29% | 400.24 | C | C5 | LLC | 3 years | RENT | 90000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=284637 | I need this loan at a better rate to consolidate my debts and pay one low monthly installment. | credit_card | Consolidate Debts | 750xx | TX | 8.81 | 0.0 | 01-10-2001 | 1 | NaN | NaN | 13 | 0 | 15486.0 | 33.10% | 13 | f | 0.0 | 0.0 | 12586.96 | 8837.09 | 12000.00 | 586.96 | 0.00 | 0.0 | 0.0 | 01-10-2008 | 114.04 | NaN | 01-09-2008 | INDIVIDUAL | 0.0 |
| 39712 | 284207 | 284204 | 7500.0 | 7500.0 | 5387.50 | 36 months | 11.97% | 249.00 | C | C4 | SOCIAL SERVICES | < 1 year | OTHER | 19200.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=284207 | Need load to consolidate debt. | debt_consolidation | DEBT CONSOLIDATION | 908xx | CA | 10.94 | 0.0 | 01-12-2002 | 3 | NaN | NaN | 12 | 0 | 6450.0 | 70.90% | 13 | f | 0.0 | 0.0 | 8964.00 | 6395.15 | 7499.99 | 1464.01 | 0.00 | 0.0 | 0.0 | 01-03-2011 | 249.00 | NaN | 01-05-2016 | INDIVIDUAL | 0.0 |
| 39713 | 284136 | 284125 | 25000.0 | 25000.0 | 8933.60 | 36 months | 9.76% | 803.87 | B | B2 | Mayfield City School District | 7 years | MORTGAGE | 70000.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=284136 | I am trying to secure this loan to increase my investment portfolio. I have between $1000.00 to $1500.00 that I have been putting towards my investments, but with the market like it is, I would like to invest in more companies now and use the $1000.00 to $1500.00 of excess cash I make each month towards this loan. This way, I would have a decent amount of cash to invest in the coming months, and the payemnts on my loan would be under what I am investing per month currently, making it easy to pay back this loan. Of course, I know there is interest on this loan, but I believe that the I can realize gains in the market over the next three years that will be more than the interest I am paying for this loan. I also currently have no outstanding loans other than a morgage payment and have never carried an overdue balance or have had a late payment on any of my credit cards. | other | Investing | 440xx | OH | 4.92 | 0.0 | 01-11-1996 | 0 | NaN | NaN | 7 | 0 | 3114.0 | 11.30% | 13 | f | 0.0 | 0.0 | 28900.64 | 9975.25 | 25000.00 | 3900.64 | 0.00 | 0.0 | 0.0 | 01-01-2011 | 45.94 | NaN | 01-01-2011 | INDIVIDUAL | 0.0 |
| 39714 | 283826 | 283823 | 15000.0 | 15000.0 | 9019.30 | 36 months | 9.45% | 480.15 | B | B1 | US Army | 10+ years | RENT | 62400.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=283826 | pay off credit card and place some money in the savings | debt_consolidation | drodo | 136xx | NY | 6.46 | 0.0 | 01-11-1995 | 2 | NaN | NaN | 4 | 0 | 5196.0 | 34.60% | 14 | f | 0.0 | 0.0 | 17332.24 | 10261.03 | 15000.00 | 2308.23 | 24.01 | 0.0 | 0.0 | 01-02-2011 | 997.37 | NaN | 01-03-2011 | INDIVIDUAL | 0.0 |
| 39715 | 283707 | 211765 | 20000.0 | 20000.0 | 4031.29 | 36 months | 11.34% | 658.00 | C | C2 | Jada Beauty | 1 year | OTHER | 55000.0 | Not Verified | 01-03-2008 | Charged Off | n | https://lendingclub.com/browse/loanDetail.action?loan_id=283707 | This loan will be used to expand my growing business. We are the first in our state to specialize in Hair Threading and Henna tattoo. We have outgrown our mall location and need to expand to a bigger location to accommodate our growing clientele. This loan will be used to put money down on a bigger location and add new services too like manicures,facials, and pedicures with and Asian twist. I am a experinced licensed professional with over 20 year experince. My financial situation: I am a good candidate for this loan because I have worked very hard to have good credit and paying off my debt is very important to me. I will also be using my own savings in this venture | other | Expanding my growing Business | 852xx | AZ | 5.19 | 2.0 | 01-03-1985 | 0 | 21.0 | NaN | 8 | 0 | 5482.0 | 17.40% | 28 | f | 0.0 | 0.0 | 6767.46 | 1369.23 | 5044.60 | 1722.86 | 0.00 | 0.0 | 0.0 | 01-02-2009 | 658.00 | NaN | 01-09-2009 | INDIVIDUAL | 0.0 |
| 39716 | 283106 | 264548 | 11000.0 | 11000.0 | 9375.00 | 36 months | 12.29% | 366.89 | C | C5 | Kaiser Permanente | 4 years | MORTGAGE | 36400.0 | Not Verified | 01-03-2008 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=283106 | Want to pay off all credit card debt acquired while in college. | debt_consolidation | Operation Freedom From Debt | 925xx | CA | 11.60 | 0.0 | 01-01-2004 | 2 | NaN | NaN | 6 | 0 | 10765.0 | 60.50% | 9 | f | 0.0 | 0.0 | 13130.98 | 11191.18 | 11000.00 | 2130.98 | 0.00 | 0.0 | 0.0 | 01-09-2010 | 2508.69 | NaN | 01-10-2015 | INDIVIDUAL | 0.0 |